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Preface 



Search Theory is one of the original disciplines within the field of Operations Research. 
It deals with the problem faced by a Searcher who wishes to minimize the time required 
to find a hidden object, or “target.” The Searcher chooses a path in the “search space” and 
finds the target when he is sufficiently close to it. Traditionally, the target is assumed to 
have no motives of its own regarding when it is found; it is simply stationary and hidden 
according to a known distribution (e.g., oil), or its motion is determined stochastically 
by known rules (e.g., a fox in a forest). 

The problems dealt with in this book assume, on the contrary, that the “target” is an 
independent player of equal status to the Searcher, who cares about when he is found. 
We consider two possible motives of the target, and divide the book accordingly. Book 
I considers the zero-sum game that results when the target (here called the Hider) does 
not want to be found. Such problems have been called Search Games (with the “zero- 
sum” qualifier understood). Book II considers the opposite motive of the target, namely, 
that he wants to be found. In this case the Searcher and the Hider can be thought of 
as a team of agents (simply called Player I and Player II) with identical aims, and the 
coordination problem they jointly face is called the Rendezvous Search Problem. This 
division of the book according to Player IPs motives can be summarized by saying that 
in a Search Game the second player (Hider) wishes to maximize the capture time T, 
while in a Rendezvous Problem the second player (Rendezvouser) wishes to minimize 
T. (In both cases, the first player wishes to minimize T.) 

Of the two problems dealt with in the book, the area of Search Games (Book 
I) is the older. These games stem in part from the “The Princess and the Monster” 
games proposed by Rufus Isaacs (1965) in his well known book on Differential Games. 
Beginning with the first search game with mobile hider to be solved (that on the circle, 
by Alpern (1974), Foreman (1974), and Zelinkin (1972)), and the subsequent solutions 
of search games on networks and regions in space by Gal (1979), the early work on such 
games culminated in the classic book of Gal (1980). This work has stimulated much 
subsequent research in the field including applications in computer science, economics, 
and biology. Much of this research is covered in Book I which contains many new results 
on Search Games as well as the classical results, presented with simpler exposition and 
proofs. However there are many open questions, even some of a fairly elementary 
nature, which are also covered here. For an extensive introduction to the area of Search 
Games, see Chapter 1. 
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The Rendezvous Search Problem (Book II) is a more recent area of interest. It asks 
how quickly two (or maybe more) players can meet together at a single location, when 
placed in a known search region, without a common labelling of locations. Although 
posed informally by Alpem as early as 1976, a rigorous formulation for the continuous 
time version did not appear until Alpern (1995). Beginning with the early subsequent 
papers of Alpem and Gal (1995) and Anderson and Essegaier (1995) on rendezvous 
on the line, the interest in this problem has expanded to encompass many variations, 
including multiple player rendezvous and different forms studied by V. Baston, A. Beck, 
S. Fekete, S. Gal, J. V. Howard, W. S. Lim, L. Thomas, and others. Particular interest 
has been paid to some discrete time rendezvous models, which have a separate history 
going back to the original papers of Crawford and Haller ( 1990) on coordination games 
in the economics literature, and Anderson and Weber (1990) in a search theory context. 
Much of this work is surveyed in the paper of Alpern (2002a). An extensive introduction 
to the field of Rendezvous Search can be found in Chapter 10. 

Although both authors have worked in the two fields of Search Games and Ren- 
dezvous Search Theory, the division of this book into two parts reflects the emphasis of 
their work. As such, Book I (Search Games) was mainly written by Shmuel Gal, and 
results in this part which are not otherwise ascribed are due to him. Similarly, Book II 
(Rendezvous Search) was mainly written by Steve Alpern, with unascribed results there 
due to him. Of course both authors take joint responsibility for this book as a whole. 

We would like to put the work of this book into its historical context with respect to 
earlier survey articles and books on search. Search Theory is usually considered to have 
begun with the work of Koopman and his colleagues on “Search and Screening” (1946). 
(An updated edition of his book appeared in 1980.) The problem of finding the optimal 
distribution of effort spent in search is the main subject of the classic work of Stone 
(1989, 2nd ed.), “Theory of Optimal Search”, which was awarded the 1975 Lanchester 
Prize by the Operations Research Society of America. Much of the early work on 
search theory surveyed by Dobbie (1968) was concerned with aspects other than optimal 
search trajectories, and as such is very different from our approach. The later survey 
ofBenkoski, Monticino, and Weisinger (1991) shows how the determination of such 
trajectories has come to be studied more extensively. Recent books on Search Theory 
include those of Ahlswede and Wegener (1987), Haley and Stone (1980), Iida (1992), 
and Chudnovsky and Chudnovsky (1989). The first book to introduce game theoretic 
aspects of search problems was of course Gal (1980), but these are also considered in 
Ruckle (1983a) and form the basis of the recent stimulating book of Garnaev (2000). 
This volume is the first to cover the new field of rendezvous search theory. 
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Lebesgue measure of the search space 
Minimal length of a tour that covers the search space 
Rate of discovery of the searcher 
Cost function (the payoff to the hider) 

Expected cost 

Normalized cost function 

Expected normalized cost 

Distance between Z\ and Z 2 

Diameter of the search space 

Expectation 

A pure hiding strategy 

The set of all pure hiding strategies 

A mixed hiding strategy 

The uniform hiding strategy 

Origin (usually, the starting point of the searcher) 

(A) Probability of an event A 
Search space 

Discovery (or detection) radius 
A pure search strategy (a search trajectory) 

The set of all admissible search trajectories 
A mixed search strategy 
Capture time 
Time parameter 

Value of the hiding strategy (v(h) = inf s c(s, h)) 

Value of the search strategy (u(i) = sup fc c(.y, h)) 

Minimal value obtained by a pure search strategy (the “pure 
value” V = inf 5 u(S)) 

Value of the search game ( v = inf, u(j) = sup^ v(h)) 
Maximal velocity of the hider 
A point in the search space 
Integer part 
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Rendezvous 
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R a ,R s 
G 
t f,g 

H(Q, p, q) 

K,M 

( var s)(r) 
Var(s) 



Expected rendezvous time 

Player-asymmetric, player- symmetric, rendezvous values 

A given group of symmetries of Q 

The maximum rendezvous time for /, g 

The H-network based on Q 

The sets of even and odd nodes of an H-network 

Total variation of function s up to time r 

Total variation of 5 over its domain 
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Book I 

SEARCH GAMES 




Chapter 1 

Introduction to Search Games 



In this book we are mainly concerned with finding an “optimal” search trajectory for 
detecting a target. In the search game part (Book I) we shall usually not assume any 
knowledge about the probability distribution of the target’ s location, using instead a min- 
imax approach. The minimax approach can be interpreted in two ways. One could either 
decide that because of the lack of knowledge about the distribution of the target, the 
searcher would like to assure himself against the worst possible case (this worst-case 
analysis is common in computer science and operations research), or, as in many mili- 
tary situations, the target is a hider who wishes to evade the searcher as long as possible. 
This approach leads us to view the situation as a game between the searcher and the 
hider. In general, we shall consider search games of the following type. The search takes 
place in a set Q called the search space. We shall distinguish between games in compact 
search spaces, which are considered in Part I and games in unbounded domains which 
are considered in Part II. The searcher usually starts moving from a specified point 
O called the origin and is free to choose any continuous trajectory inside Q, subject 
to a maximal velocity constraint. As to the hider, in some of the problems it will be 
assumed that the hider is immobile and can choose only his hiding point, but we shall 
also consider games with a mobile hider who can choose any continuous trajectory 
inside Q. It will always be assumed that neither the searcher nor the hider has any 
knowledge about the movement of the other player until their distance apart is less than 
or equal to the discovery radius r, and at this very moment capture occurs. 

Each search problem will be presented as a two-person zero-sum game. In order 
to treat a game mathematically, one must first present the set of strategies available to 
each of the players. These strategies will be called pure strategies in order to distin- 
guish between them and probabilistic choices among them, which will be called mixed 
strategies. We shall denote the set of pure strategies of the searcher by S and the set 
of pure strategies of the hider by 'H. A pure strategy S 6 S is a continuous trajectory 
inside Q such that S(t) represents the point that is visited by the searcher at time t. 
As to the hider, we have to distinguish between two cases: If the hider is immobile, 
then he can choose only his hiding point H e Q. On the other hand, if he is mobile, 
then his strategy H is a continuous trajectory H(t) so that, for any t > 0, H(t) is the 
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point occupied by the hider at time t. The next step in describing the search game is to 
present a cost function (the game-theoretic payoff to the maximizing hider) c(S, H ), 
where S is a pure search strategy and H is a pure hiding strategy. The cost c(S, H) has 
to represent the loss of the searcher (or the effort spent in searching) if the searcher uses 
strategy S and the hider uses strategy H. Since the game is assumed to be zero-sum, 
c(S, H) also represents the gain of the hider, so that the players have opposite goals: the 
searcher wishes to make the cost as small as possible, while the hider wishes to make it 
large. A natural choice for the cost function is the time spent until the hider is captured 
(the capture time). For the case of a bounded search space Q. this choice presents no 
problems. But if Q is unbounded and if no restrictions are imposed on the hider, then he 
can make the capture time as large as desired by choosing points that are very far from 
the origin. We overcome that difficulty either by imposing a restriction on the expected 
distance of the hiding point from the origin or by normalizing the cost function. The 
details concerning the choice of a cost function for unbounded search spaces are pre- 
sented in Chapter 6. Given the available pure strategies and the cost function c(S , H), 
the value u(5) guaranteed by a pure search strategy S is defined as the maximal cost 
that could be paid by the searcher if he uses the strategy S', thus, 

u(5) = sup c(S, H). (1.1) 

HeW 

We define the minimax value of the game V as 

V = inf u(S). (1.2) 

SeS 

Then for any £ > 0, the searcher can find a pure strategy S, which guarantees that the 
loss will not exceed (1 + £) V. A pure strategy S e that satisfies 

v(S e ) < (1 +e)T 

will be called an e-minimax search trajectory. If there exists a pure strategy S which 
satisfies 

tt(5) = V, (1.3) 

then S will be called a. minimax search trajectory. 

The value V represents the minimal capture time that can be guaranteed by the 
searcher if he uses a fixed trajectory, but in all the interesting search games the searcher 
can do better on the average ifhe uses random choices out of his pure strategies. These 
choices are called mixed strategies (see Appendix A). 

If the players use mixed strategies, then the capture time is a random variable, so that 
each player cannot guarantee a fixed cost but only an expected cost. Obviously, any pure 
strategy can be looked on as a mixed strategy with degenerate probability distribution 
concentrated at that particular pure strategy, so that the pure strategies are included in 
the set of mixed strategies. A mixed strategy of the searcher will be denoted by s and a 
mixed strategy of the hider will be denoted by h. The expected cost of using the mixed 
strategies s and h will be denoted by c(s, h). We will use the notation u(.) to denote 
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the expected cost guaranteed by a player (either the searcher or the hider) if he uses a 
specific strategy. Thus, v(s) is the maximal expected cost of using a search strategy s : 

v(s) = supc(.v, h) = sup c{s, H). (1.4) 

h HeH 

u(i)will be called the value of strategy s. Similarly, the minimal expected cost v(h) of 
using a hiding strategy h 



v(h) — inf c(s, h) = inf c(S, h) (1.5) 

S SeS 

will be called the value of strategy h. It is obvious that for any .v and h, v(s) > v(h), 
because u(.y) > c(s , h) > v(h). 

If there exists a real number v that satisfies 

v = inf v(s) = supu(/i), (1.6) 

* h 

then we say that the game has a value v. In this case, for any e > 0, there exist a search 
strategy s e and a hiding strategy h £ that satisfy 

u(s,:) < (1 +f)i> and v(h g ) > (1 — e)u. (1.7) 

Such strategies will be called f -optimal strategies. In the case that there exists s (resp. h) 
such that u(s) — u (resp. u(/t) = u), then .? (resp. h) is called an optimal strategy. 

The reader will have noticed that to avoid more cumbersome notation we have made 
a rather versatile use of the letter y. Its meaning will depend on the context: without 
an argument, it denotes the value, as defined in (1.6). When its argument is a search 
strategy, it is defined by equation (1.4); when a hiding strategy, by (1.5). 

In general, if the sets of pure strategies of both players are infinite, then the game 
need not have a value (for details, see Luce and Raiffa, 1957, Appendix 7). However, 
in Appendix A we shall show that any search game of the type already described has 
a value and an optimal search strategy. (The hider need not have an optimal strategy 
and in some games he has only £-optimal strategies.) 

Keeping the previous framework in mind, we present a general description of the 
material covered in Book I (Parts 1 and II). This book contains many new results on 
search games, as well as the classical results presented with simpler exposition and 
proofs. It also contains many open problems, even some of elementary nature, which 
hopefully will stimulate further research. 

In Part I we consider search games in compact spaces within the framework pre- 
sented in Chapter 2. In Chapter 3 we analyze search games with an immobile hider in 
networks and in multidimensional regions. Among the topics considered we investi- 
gate the performance of the following natural search strategy: Find a minimal closed 
curve L that covers all the search space Q. ( L is called a Chinese postman tour.) Then, 
encircle L with probability 1/2 for each direction. This random Chinese postman tour 
is indeed an optimal search strategy for Eulerian networks and for trees, or if Q is 
a two-dimensional region. An intriguing problem is to characterize the family of graphs 
for which the optimality property of the random Chinese postman holds. The solution, 
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found recently by Gal (2000), is presented. The difficulties associated with solving 
search games for networks outside of the above family are presented. We also present 
a dynamic programming algorithm for numerically finding an optimal search trajectory 
against a known hiding strategy. This algorithm can sometimes help us to solve search 
games that are difficult to handle analytically. 

Problems with a mobile hider are usually more difficult. (This statement, however, 
is not always true. For example, the solution of the search game on three arcs is easy for 
a mobile hider but very difficult for an immobile hider.) Search games with a mobile 
hider are analyzed in Chapter 4. We first present the solutions for the search game on 
the circle and on k unit arcs connecting two points. We also present several new results 
on networks that can be relatively quickly searched, including the figure eight network. 
The k arcs game can serve as a useful introduction for the Princess and Monster game 
in two (or more) dimensional regions, analyzed later in this chapter. 

In Chapter 5 we consider four types of search games in compact spaces, which do 
not fall into the framework of Chapters 3 and 4. We present in detail new results for 
searching in a maze (i.e., a network with an unknown structure) and “high-low” search 
in which the searcher gains a directional information in each observation. Then we 
survey the problems of searching for an infiltrator who would like to reach a sensitive 
zone and searching in discrete locations. 

In Part II we consider search games in unbounded domains. The general framework 
of such problems is described in Chapter 6. We introduce the normalized cost function 
(called the competitive ratio in Computer Science literature). We show that solving 
a search game using a normalized cost function is usually equivalent to restricting the 
absolute moment of the hiding strategy by an upper bound. 

In Chapter 7 we develop a general tool for obtaining minimax trajectories for 
problems involving homogeneous unimodal functionals. We show that the minimax 
trajectory is a geometric sequence. This enables us to easily find it by minimizing over 
a single parameter (the generator of this sequence) instead of searching over the whole 
trajectory space. The results obtained in Chapter 7 are used in Chapters 8 and 9 but the 
proofs of the theorems are mainly for experts and can be skipped at first reading. 

The linear search problem (LSP), i.e., finding a target with a known distribution 
on the line, has been attracting much attention over several decades. This problem was 
analyzed as a search game by Beck and Newman (1970) and by Gal (1980). In Chapter 8 
we present the above classical results along with several variants. In addition we present 
a new model of the linear search game when changing the direction of motion requires 
some time and cannot be done instantaneously (as originally assumed in the LSP). We 
also present a new dynamic programming algorithm for computing, with any desired 
accuracy, the optimal search trajectory of the LSP for any known hiding distribution. 

In Chapter 9, the last chapter of Book I, we use the tools developed in Chapter 7 to 
solve several search games. At first we find a minimax trajectory for searching a set of 
rays. This problem has recently attracted a considerable attention in computer science 
literature. We then present some new results for the minimax search trajectory on the 
boundary of a region in the plane. Then we analyze the minimax search trajectory for 
a point in the plane. We also discuss several classical and new “swimming in the fog” 
problems in which we have to find a minimax trajectory to reach a shoreline of a known 
shape, starting from an unknown initial point. We then conclude by presenting an open 
problem of searching for a submarine with a known initial location. 




Part One 

Search Games in 
Compact Spaces 




Chapter 2 

General Framework 



The search spaces considered in Part I are closed and bounded subsets of a Euclidean 
space. They are usually either a compact region (i.e., the closure of a connected bounded 
open set) in a Euclidean space with two or more dimensions, or a network. In this book 
a network will mean a finite connected set of arcs, called edges, which can intersect only 
at their endpoints, called nodes. Examples of such networks are a circle, a tree, a set of 
k arcs connecting two points, etc. Obviously, if a graph is given in the combinatorial 
form of nodes and edges, then it can be embedded in a three-dimensional Euclidean 
space R 3 in such a way that the edges intersect only at nodes of the network. (Two 
dimensions are not sufficient for nonplanar networks.) Thus, we shall look upon each 
Network as a subset of/? 3 . Each arc in the Network has a given length and an associated 
distance function defined on it. 

Definition 2.1 The distance d(x, y ) between any two points x and y in a network Q is 
defined as the minimum length among all the paths that connect x and y within Q. The 
diameter D ofQ is defined as the maximum distance between two of its points, that is, 

D = max d(x, y ). 
x.yeQ 

We now describe more specifically a search game A in the space Q, with the outline 
given in Chapter 1. A pure search strategy 5 is a continuous trajectory inside Q that does 
not exceed a fixed maximal velocity. The time unit will be chosen so as to normalize this 
maximal velocity to 1. Such a trajectory S(t) is a continuous mapping S : [0, oo) —*■ Q 
satisfying 

d(S(t\), S(t 2 )) <t 2 — ti for any t 2 > t\> 0. 

We shall usually assume that the searcher has to start from a fixed point O to be called 
the origin (i.e., 5(0) = O ), but we shall sometimes consider other possibilities such 
as a chosen or a random starting point. The set of all pure search strategies is denoted 
by S. A pure hiding strategy H is an arbitrary continuous trajectory inside Q with 
maximal velocity not exceeding a given maximum hider velocity w. In the case w — 0, 
the hider is immobile and H is a single point, while if w > 0, then the hider is mobile 
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and H is a trajectory that satisfies 

d{H(t\), H(J 2 )) < w x (t 2 - /i) for all t 2 > t\> 0. 

The case of a mobile hider also includes the possibility of w = oo, i.e., a hider, 
moving along a continuous trajectory, with an unbounded velocity. The set of all pure 
hiding strategies is denoted by H. 

We assume that the searcher and the hider cannot see one another until their distance 
is less than or equal to the discovery (or detection) radius r and at that very moment 
capture occurs (and the game terminates). In cases where Q is a network, then (for 
convenience) r will be taken as zero. (Actually, r can usually be chosen as a small 
positive number without introducing any significant changes in the results.) If Q is a 
multidimensional region, then it will be assumed that r is very small in comparison 
with the magnitude of Q. (To be more precise, we will assume that yr <3C /Li, where / j, 
and y are, respectively, the Lebesgue measure, of appropriate dimension, of Q and the 
boundary of Q.) In order to simplify the presentation of the results, we shall generally 
consider the case in which both the maximal velocity of the searcher and the radius of 
detection are constants. However, we shall also extend the results to the case where the 
maximal velocity of the searcher depends on his location and the radius of detection 
depends on the location of the hider. We will call such a case an inhomogeneous search 
space. 

Whenever the search space Q is a Network or a subset of Euclidean space, it is 
endowed with Lebesgue measure of the appropriate dimension (corresponding to length, 
area, volume, etc.). To avoid a separate notation for the total measure of Q. we make 
the following simplifying definition. 

Definition 2.2 The Lebesgue measure of any measurable subset B of Q is denoted by 
p{B). The total measure of Q is denoted by p - p (Q). 

The set of points of Q which have been “searched” by a trajectory S by time t is 
denoted by Xs(t). That is, 

X s (t) = [x€ Q:d(S(t'),x) <r for someO <l'< t}. (2.1) 

Obviously, the set that is discovered at time 0 does not depend on S. Its measure 
will be denoted by /xo, i.e., p o = p.(Xs(0)). 

The following notion describes the maximum rate at which new points of Q can be 
discovered. 

Definition 2.3 The maximal discovery rate p of the searcher is defined as, 

p(Xs( 0 ) - mo 

p — sup . 

s,i> o t 

Since the maximal velocity of the searcher is 1 it follows that p = 1 for search in 
a network. In case that Q is a two-dimensional region, the sweep width is 2 r, so that 
the maximal area p of the strip that can be swept in one unit of time is 2 r. By a similar 
reasoning, p is equal to nr~ for three-dimensional regions, and so on. 
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The capture time, which is denoted by c(S, H) (and sometimes by T) represents 
the loss of the searcher (and the gain of the hider). It is formally defined as 

c(S, H) = min{/ : t/(S(f), H(t)) < r}. 

If no such t exists, then we say c(S, H) = oo. 

A mixed strategy s' (resp. h) of the searcher (resp. hider) is a probability measure 
on S (resp. H). In order to rigorously present such strategies, one has to introduce a 
substantial amount of measure-theoretic machinery for S and TL. Such a construction 
is briefly presented in Appendix A. In Gal (1980) full details are presented, including 
the result that c(S, H) is Borel measurable in both variables, so that we can define the 
payoff c, in the case that the searcher uses s and the hider uses h, as the expected value 
of c with respect to the product measure s x h: 

c(j, h) = j C(S, H)d(s X h). (2.2) 

The fundamental results (see Appendix A) are that any search game as described 
above has a value u, i.e., 

inf supc(,s, h) = supinfc(s, h) (2.3) 

•* h h s 

and that the searcher always has an optimal strategy. Thus, for any such search game, 
the searcher can always guarantee an expected payoff not exceeding v, while the hider 
can guarantee that the expected payoff exceeds (1 — e)v. 

For the search games presented in this book, we shall generally use constructive 
methods to find the value and the optimal (or £-optimal) strategies of the players. In 
the case of a network, whenever we can obtain a solution of the game, it will be 
an exact solution. On the other hand, the solutions that we get for the search games 
in multidimensional regions depend on the fact that the detection radius r is small. In 
this case, we shall present two strategies s E and h e and a function fir) which satisfy 
(see (1.7)) 

u&) < (1 + ®)/(r) and v(h e ) > (1 — e) f(r), where s — > 0 as r — > 0. 

Thus, s e and h £ are e-optimal strategies and u ~ fir) for small r. 

In calculating the expected capture time of the search games to be considered, we 
shall often use the following result, which is well known in probability theory (see, e.g., 
Feller, 1971, p. 150). 

Proposition 2.4 The expected value E(T) ofa nonnegative random variable T satisfies 

poo 

E(T) = / Pr(T > t)dt. (2.4) 

Jo 

Since Pr(T > t) is monotonic nonincreasing in t, it follows from (2.4) that for any 
positive number fi 

~ /•<;+ \)P ~ 

£(D=V/ Pr(T > t)dt < pYPr(T > iP). 

/=o ^ to 



(2.5) 
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Similarly, 



OO 00 

E(T) >pY, Pr W > O' + »)J8) = 0 J2 Pr( - T > w- (2-6) 

i= 0 i= I 

We now present a simple but useful result known as the scaling lemma , which will 
enable us to normalize the arc lengths in some networks and will also be used for search 
games in unbounded domains. It actually states that changing the unit length in Q 
affects the search game in a very simple manner. 

Proposition 2.5 (Scaling Lemma) Let A be a search game in a set Q with an origin 
O and a detection radius r. Assume that the value of A is V and that s, h are optimal 
(e-optimal) strategies. Consider a set Q with a metric d, which is obtained from Q by 
an onto mapping <t> : Q — > Q with the following property for some a > 0: 

d(<&(x), <£()>)) = a d(x, y), for all x,y e Q. 

Define a search game A in Q with an origin O = <t>(0), a detection radius r = ar, 
and the same maximal velocitiesfor the searcher and the hideras in A. Then the value 
v of Asatisfies v = otvand the optimal f E-optimal) strategies of A are obtained by 
applying the mapping to the trajectories in Q and changing the time scale by afactor 
of a. 

The proof is based on the simple observation that for any pair of trajectories S and 
H. in Q, the capture time corresponding to 4>(5) and <$(//), in Q, would be multiplied 
by a. A formal proof is given in Gal (1980). 

An identical argument shows that an analogous result holds for the rendezvous 
search problems discussed in Book II. 




Chapter 3 



Search for an Immobile Hider 



3.1 Introduction 

In this chapter, we consider search games in compact spaces with an immobile hider. In 
this case, a pure hiding strategy H is simply a point in the search space Q, and a mixed 
hiding strategy h is a probability measure on Q. A pure search strategy S is a continuous 
trajectory in Q, starting at the origin O, with maximal velocity not exceeding 1. Since 
the hider is immobile, it can be assumed that the searcher will always use his maximal 
velocity because any trajectory that does not use the maximal velocity is dominated by 
a trajectory that uses the maximal velocity along the same path. A mixed search strategy 
s is a probability measure on the set S of these pure strategies. 

A hiding strategy that plays an important role in some of the games to be presented is 
the uniform strategy h tl , which chooses the hiding point in Q “completely randomly.” 1 
More precisely: 

Definition 3.1 The uniform strategy h )L is a random choice of the hiding point H such 
that for all measurable sets B e Q, 

Pr(H € B | 

Recall (see Definition 2.2) that the use of p without an argument means that the 
argument is Q. That is, p, = p(Q). 

Note that it makes more sense for the hider not to hide within distance r from O 
using the uniform distribution on the rest of Q. However, since r is either 0 or very 
small with respect to the magnitude of Q, we will not use this ^-improvement. 

Our next result shows that if the hider chooses his hiding point according to the 
uniform strategy h^, he ensures an expected evasion (capture) time of at least p/2p, 
where p is the searcher’s maximal discovery rate, as introduced earlier in Definition 2.3. 
This result holds not only for search strategies in S (continuous search paths) but even 
for the following larger class of generalized search strategies. 



'Using normalized Lebesgue measure on Q. 
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Definition 3.2 A generalized search strategy is defined by the sets X(t) C Q that it 
has “discovered” by time t. The sets X ( t ) are only required to satisfy the conditions 

X(t')cX(t) fort' <t, and p(X(t) - X(0)) < pt. 

In particular, every continuous strategy S 6 S defines a generalized strategy by the 
formula (2.1). 

(Note that we usually restrict the searcher to move along a continuous trajectory and 
so do not allow generalized search strategies. These strategies will be discussed only in 
this section in order to introduce the unrestricted search game, which will be solved in 
Theorem 3.7.) 

Theorem 3.3 If the hider chooses his hiding point according to the uniform strategy 
h then he ensures an expected capture time of at least p/2p against any generalized 
search strategy and in particular against any trajectory S € <S. 

Proof. Let /xo = p(X(0)) denote the measure of the set of points discovered at 
time 0. (In the case that r = 0, which we shall generally assume for networks, we have 
/xo — 0; for the multidimensional spaces r is very small so that MO <£ p.) In any case 
it follows from the definition of a generalized strategy that 

M(X(0) = P(X(0)) + [//(*(/) - X(0))] < Mo + Pt, 



or simply 



p(X (/)) < pt, in the case r = 0 and Mo = 0. 



Consequently, for r - 0, the probability that a hider hidden according to the distribution 
hf,. has been found by time t is given by 



Pr(J <t)< min 




and hence by (2.4) we have 



max 



i — — , 0 



c(S,h tl ) = E(T) > [ 

Jo 

Jo V M / 2 p 



dt 



(If we do not assume that /xo = 0, the same analysis gives the slightly more complicated 
estimatec(5, h^) > (/x 2 — 2/x/xo + Mo) /2 MP ~ M/2p.) • 

The following result is an immediate consequence of the considerations used in the 
above theorem. 



Corollary 3.4 If S satisfies c(S, h tt ) = p/2 p, then for all 0 < / < p/p the measure 
of the points swept by S in the time interval (0, t\ is equal to pt (i.e., S sweeps without 
overlapping). 
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Since the hiding strategy h M guarantees p/2 p against any starting point of the 
searcher, we also have the following. 

Corollary 3.5 Let A be a search game with value v = p/2p and A' be the search 
game obtained from A by allowing the searcher to choose his starting point. Then the 
value of A 1 is also /i/2 p. 

The extension of the preceding discussion to search games with more than one 
searcher is presented in the following result. 

Corollary 3.6 Consider a search game with one immobile hider and J searchers, with 
the j -th searcher having a maximal velocity Vj. Assume that all the searchers cooperate 
in order to discover the hider (by at least one of them) as soon as possible. Let p j be the 
Lebesgue measure of a set, which can be swept by the j-th searcher in one unit of time, 
and define the total rate of discovery p = Pj. Then Theorem 3.3 holds for this 

game, with p replacing p. 

Proof. LetA(f) denote the set of all points discovered by at least one of the 
J searchers by time t. Then since X ( t ) is easily seen to be a generalized strategy 
with respect to the parameter p. Theorem 3.3 applies to this game as claimed. ■ 

Note that since the uniform strategy h lt is always available to the hider, Theorem 3.3 
shows that /i/2 p is a lower bound for the value of any search game, even if generalized 
search strategies are allowed. In fact, the following result of the authors shows that if 
we allow generalized strategies (and mixtures of them), p/2 p is always the value of 
the resulting “unrestricted game.” Note that for all the search spaces Q, which we will 
consider in this book, the measure space ( Q , //) has the following properties: there are 
no atoms, and any subset of a measure zero set is measurable. Such a measure space is 
called a Lebesgue space. 

Theorem 3.7 The value of the unrestricted search game on any Lebesgue space Q is 
given by p/2p, where p denotes the total measure p(Q). 

Proof. According to the Theorem 3.3, we need only present a generalized search 
strategy that finds any hiding point in expected time not exceeding p/2p. A simple 
construction is to find any generalized search strategy X(t) that sweeps without over- 
lapping during the time interval |0, p/p] and define A(/)as the “reverse” of A ( 1 ) (i.e., 
any point first covered by X at time t is first covered by X at time t — p/p). Then the 
generalized (mixed) strategy that adopts X and X equiprobably, discovers any H € Q 
in expected time p/2p. 

For readers who are familiar with measure theory we present a formal proof of 
the theorem as follows. For any Lebesgue space (Q, //) there exists an invertible bi- 
measurable map cp : [0, p(Q)] —> Q, which takes one-dimensional Lebesgue measure 
p\ into the measure //, (that is, p(cp([a, b])) = b — a, for 0 < a < b < p(Q)) 
(see Halmos, 1950). Define two generalized strategies X and X by X (!) — 0 ( [0, pt\) 
and X(t) = <t>(\p(Q) - pt,p(Q)]), for 0 < t < p(Q)/p. For any hiding point 
H € Q, choose t = 1 ( H) and observe that H e X(t/p) PI X((p(Q)/p) — tjp). 

Consequently, if the searcher adopts X and X equiprobably, then any point H will 
have been discovered in expected time \{t/p + ((p(Q)/p) — t/p)) = p(Q)/2p , as 
claimed. ■ 
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The unrestricted game is similar to a discrete search game in which Q consists of 
n cells of equal size. We now formulate and solve a more general discrete version of the 
search game. In the game to be considered, Q consists of n cells of sizes fi\, . . . , Hn 
and the measure of Q is defined as fx = . It is assumed that the maximal rate of 

discovery of the searcher is p, so that it takes him Hi / p units of time to look at cell 
number i. It is also assumed that if the hider is located in cell i and if the searcher starts 
to look in this cell at time t, then the hider is discovered at time t + Hi /2p. A pure 
hiding strategy H is an element of the set { 1, 2 ,...,«} , while a pure search strategy s 
is a permutation (z'i , . . . , ;„) of the numbers (1, 2,.. n). We now show that the result 
v = n/2p also holds for this discrete version. 

Proposition 3.8 The value of the discrete search game is p/2p. An optimal hiding 
strategy h assigns a probability of Hi/H to each cell, and an optimal search strategy 
S is to choose any permutation (jj, ...,(„) and to assign a probability 1/2 to this 
permutation and a probability 1/2 to its “reverse” (i „, . . . , i\). 

Proof. For any permutation S = (j’j, . . . , z„), the strategy h satisfies 



1 " 

c(S, h) = ~Y 

P U 





M 



1 

HP 



£ 

1 <m< j<n 







i 

2 HP 




M_ 
2 p' 



For all // g {1, . . . , n}, the mixed strategy s satisfies 

C(S, H) = i(C(0,, .... in), H) + C((f„, .... 11), H)) 

_ M_ 

2 p' 

Thus v(h) = u(i) = v = n/2p, I 

A description of some other discrete search games is given in Chapter 5. 

The expression /z/2/0 can be looked upon as the value that is obtained if the searcher 
is able to carry out his search with maximal efficiency. The games considered in this 
book are obviously restricted by the fact that the searcher has to move along a continu- 
ous trajectory so that the value does depend on the structure of Q. We shall have cases, 
such as Eulerian networks (Section 3.2), in which the searcher can perform the search 
with “maximal efficiency” which assures him a value of /z/2p- A similar result holds 
for search in two-dimensional regions with a small detection radius (Section 3.7), and 
in this case the searcher can keep the expected capture time below (1 + e)/z/2p. On 
the other hand, in the case of a non-Eulerian network, we shall prove that the value 
is greater than M/2p and that the maximal value is h/P (Section 3.2). This value is 
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obtained in the case that Q is a tree (Section 3.3). A more general family that contains 
both the Eulerian networks and the trees as subfamilies is the weakly Eulerian net- 
works (see Definition 3.24) for which the optimal search strategy has a simple structure 
similar to that for Eulerian networks and the trees (Section 3.4). We shall also demon- 
strate the complications encountered in finding optimal strategies in the case that Q is 
not weakly Eulerian, even if the network simply consists of three unit arc connecting 
two points (Section 3.5). Dynamic programming is sometimes an effective technique 
for numerically computing an optimal search trajectory against a given hiding strat- 
egy (Section 3.6). For example, this technique can be used to numerically verify the 
optimality of rather complex strategies for the three-arcs search game. 



3.2 Search in a Network 

In our discussion, “network” will mean any finite connected set of arcs that intersect 
only at their end points which we call nodes of Q. Thus, Q can be represented by a 
set in a three-dimensional 2 Euclidean space with nodes consisting of all points of Q 
with degree ^ 2 plus, possibly, a finite number of points with degree 2. (As usual, the 
degree of a node is defined as the number of arcs incident at that node.) Note that we 
allow more than one arc to connect the same pair of nodes. The sum of the lengths of 
the arcs in Q will be denoted by p, and called either the total length or the measure. 

In studying search trajectories in Q, we shall often use the term closed trajectory, 
defined as follows. 

Definition 3.9 A trajectory S(t ) defined for 0 < f < x is called “closed ” ifSiO) = 
S( t). (Note that a closed trajectory may cut itself and may even go through some of the 
arcs more than once.) If a closed trajectory visits all the points of Q, then it is called 
a tour. 

We now consider a family of networks that lend themselves to a simple solution of 
the search game. These are the Eulerian networks defined as follows. 

Definition 3.10 A network Q is called Eulerian if there exists a tour L with length p. 
in which case the tour L will be called an Eulerian tour. A trajectory S(t), 0 < t < p, 
which covers all the points of Q in time p, willbe called an Eulerian path, (Such apath 
need not be closed. ) 

It is well known that Q is Eulerian if and only if the degree of every node is even 
and that it has an Eulerian path starting at O if and only if the only nodes of odd degree 
are O and another node A. In this case every Eulerian path starting at O must end at A 
(see Harary, 1972). 

Since the maximal rate of discovery p in networks is 1, it follows from Theorem 3.3 
that p!2 is a lower bound for the value of the search game in any network. We now 
show that this bound is attained if and only if Q is Eulerian. 

Theorem 3.11 The value of the search game for an immobile hide r on a network Q is 
equal to f. i/2 (half the total length of Q) if and only if Q is Eulerian. 

- Two dimensions are sufficient for planar networks. 
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Proof. First suppose that Q is Eulerian and that L is an Eulerian tour. Define L as the 
reverse path given by L(t ) = L(ji — t) and define the mixed strategy s to pick L and L 
equiprobably. For any hiding point H in Q, there is at least one to < P- with L(to) = H 
and hence L(n — to) = L(/o) = H . Consequently c(§, H) < (l/2)(/o + (p — to)) = 
/x /2. On the other hand, it follows from Theorem 3.3 that if the hider uses the uniform 
strategy , we have c(S, h t f) > p/2 for any pure search strategy S. So if Q is Eulerian, 
the value is half its total length. 

Suppose now that v = p/2. and Q is not Eulerian. By the first assumption, 
Corollary 3.4 (with t = p and p — 1) says that any optimal strategy s must be supported 
by pure strategies S for which 

p[S(x) : 0 < x < p) = p, and hence (S(jc) : 0 < x < yu.} = Q. 

In other words, S must be an Eulerian path (not tour) starting at O. Consequently there 
is a unique node A of Q with odd degree such that every Eulerian path ends at A. We 
will construct a small modification h 1 of the uniform hider distribution h M such that for 
every Eulerian path S we have 

c(S, ti) > c(S, hj = /i/2, (3.1) 



and hence 



c(s, h') > c(i, h^) — n/2, 

which contradicts our optimality assumption for 5. 

To construct h 1 , let a >0 be the minimum length of the arcs incident at A. Define 
the mixed strategy h' by first using and then simply moving any hider H with 
a/2 < d(H, A) < a to the point H' on the same arc with 

d(H', A) = d(H,A)-a/2. 

That is, we move such hiders H a distance a/2 closer to A. For any Eulerian path S 
and for H as above, we have c(S, H') — c(S, H) = 8 x a/2, where 8 — 1 if S traverses 
the arc containing H toward A and 8 = — 1 if this arc is traversed away from A. Since 
any Eulerian path S traverses one more of the arcs incident at A toward A than away 
from A, we have 

c(S, ti) > c(S, hff), 

completing the proof by establishing the required inequality (3.1). ■ 

Corollary 3.12 For an Eulerian network, the optimal strategies and the value of the 
search game remain the same ifwe remove the usual restriction 5(0) = 0 and instead 
allow the searcher to choose his starting point. 

The claim of the corollary is an immediate consequence of Corollary 3.5. 

Remark 3.13 Corollary 3.12 does not hold for non-Eulerian networks because (unlike 
the Eulerian case) the optimal hiding strategy usually depends on the starting point 
of the searcher. In general, ifwe allow the searcher to choose an arbitrary starting 
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point, then the value of this game is p/2 if and only if there exists an Eulerian path 
(not necessarily closed) in Q. (If there exists such a path, L, then the searcher can 
keep the expected capture time < p/2 by an analogous strategy to s of Theorem 3.11, 
choosing the starting point randomly among the two end nodes ofL. If there exists no 
Eulerian path in Q, then the hider can keep the expected capture time above p/2 by 
using h nf 

We now establish an upper bound for the value, which holds for all networks. 

Definition 3.14 A closed trajectory that visits all the points of Q and has minimal 
length will be called a minimal tour (or a Chinese postman tour ) and is usually denoted 
by L. Its length will be denoted by p. 

Lemma 3.15 Any minimal tour satisfies p < 2 p. Equality holds only for trees. 

Proof. Consider a network Q 2 obtained from Q as follows. To any arc b in Q, 
add another arc h that connects the same nodes and has the same length as b. Since 
every node of Q 2 has even degree, it follows that Q 2 has an Eulerian tour L 2 of length 
p(Q 2 ) = 2p(Q) — 2p. If we now map the network Qi into the original network Q 
such that both arcs b and b of (9 2 are mapped into the single arc b of Q, then the tour 
L 2 is mapped into a tour L of Q with the same length 2 p. If Q is not a tree, it contains 
a circuit C. If we remove all new arcs b in Q 2 corresponding to this circuit, then the 
resulting network is still Eulerian and contains Q but has total length less than 2 p. ■ 

Finding a minimal tour for a given network is called the Chinese postman problem. 
This problem can be reformulated for any given network Q as follows. Find a set 
of arcs, of minimum total length, such that when these arcs are duplicated (traversed 
twice in the tour), the degree of each node becomes even. This problem was solved by 
Edmonds (1965) and Edmonds and Johnson (1973) using a matching algorithm that 
uses 0(n 3 ) computational steps, where n is the number of nodes in Q. This algorithm 
can be described as follows. First compute the shortest paths between all pairs of odd- 
degree nodes of Q. Then, since the number of odd degree nodes is even, partition them 
into pairs so that the sum of lengths of the shortest paths joining the pairs is minimal. 
This can be done by solving a weighted matching problem. The arcs of Q in the paths 
identified with arcs of the matching are the arcs that should be duplicated (i.e., traversed 
twice). The algorithm is also described by Christofides (1975) and Lawler (1976). (An 
updated survey on the Chinese postman problem is presented by Eiselt et al., 1995.) 

Once an Eulerian network is given, one can use the following simple algorithm for 
finding an Eulerian tour (see Berge, 1973). Begin at any node A and take any arc not 
yet used as long as removing this arc from the set of unused arcs does not disconnect 
the network consisting of the unused arcs and incident nodes to them. Some algorithms, 
which are more efficient than this simple algorithm were presented by Edmonds and 
Johnson (1973). Actually, it is possible to slightly modify Edmond’s algorithm in order 
to obtain a trajectory (not necessarily closed), which visits all the points of Q and has 
minimal length. This trajectory is a minimax search trajectory, and its length is the 
minimax value of the game. 

Example 3.16 Consider the graph in Figure 3.1 (having the same structure as 
Euler’s Koninsberg bridge problem ) in which all the four nodes have odd degrees. 
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Figure 3.1. 



The duplicated arcs in the minimal tour can be either {fli , b\ , c \ } (based on the par- 
tition { AB , OC }) or[b\,d] (based on {OB, AC}) or [a\,e] (based on {OA, BC}). 
The corresponding sum of the lengths of the arcs is 5 or 4 or 5. Thus, the minimal 
tour duplicates the arcs b\ and d. The minimal tour can be traversed by the following 
trajectory Si 

Si = 0b\Bb\0biBeCdAa\0a2AdCc\0 (3.2) 

with length ft = 20. A minimax search trajectory is based on duplicating arcs (having 
minimal total length ) in order to make all the node degrees even exceptfor the starting 
point O plus another node. It can be easily seen that the arcs that have to be duplicated 
are a\ and b\ leading to the following minimax trajectory 

Ob[ Bb\ ObjBeCdAai Oa\AaiOc\C 



with length 18. 

Using the length ft of the minimal tour, we now derive an upper bound for the value 
of the search game, with immobile hider, in a network in terms of the length of its 
minimal tour. 



Definition 3.17 The search strategy s that encircles L equiprobably in each direction, 
will be called the random Chinese postman tour. 



Lemma 3.18 For any network Q, the random Chinese postman tour, s , finds any point 
FI in expected time not exceeding /i/2. Consequently v < /i/2. 



Proof. For any hiding point H, if a path of.? reaches H at time t, then the opposite 
path reaches it not later than ft — l . Consequently, s finds H in an expected time not 
exceeding 



1 1 _ /U 

5 W + 5 0i-0<f. 



(Note that such a search strategy was shown to be optimal for Eulerian networks, in 
the beginning of Theorem 3.11. However, this strategy need not be optimal for other 
networks.) ■ 
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A random Chinese postman tour of Figure 3.1 is to equiprobably follow Si (see (3.2)) 
or the same path in the opposite direction: 

0c\CdAci20a\AdCeBb 2 0biBb\0. 

Combining Theorems 3.3 and 3.11 and Lemmas 3.15 and 3.18 we obtain the fol- 
lowing result. (The last statement of the theorem will be proven in the next section in 
Theorem 3.21.) 

Theorem 3.19 For any network Q, the value v of the search game with an immobile 
hider satisfies 

p/2 <v< jx/ 2 < p. 

The lower bound is attained if and only ifQ is Eulerian. The upper bound p is attained 
if and only if Q is a tree. 



3.3 Search on a Tree 

We now consider the search game on a tree. Our main findings (Theorem 3.21 ) are that 
the value of such a game is simply the total length of the tree (i> = p) and that a random 
Chinese postman tour is optimal for the searcher. The optimal strategy for the hider is to 
pick among the terminal nodes according to a certain recursively generated probability 
distribution, which will be explicitly described. 

The fact that v < p, is an immediate consequence of Theorem 3.19. The reverse 
inequality is more difficult to establish. First observe that if x is any point of the tree 
other than a terminal node, the subtree Q x (the connected component, or components, 
of Q - {*}, which doesn’t contain the starting point O) contains a terminal node y. 
Since no trajectory can reach y before x, hiding at y strictly dominates hiding at x. So 
we may restrict our hiding strategies to those concentrated on terminal nodes. 

To motivate the optimal hiding distribution over the terminal nodes, we first consider 
a very simple example. Suppose that Q is the union of two trees Q i and Qj that meet 
only at the starting node O. Let /x, denote the total length of Q, . Let /?,■ denote the 
probability that the hider is in the subtree Q\. Assume that the searcher adopts the 
strategy of first using a random Chinese postman tour of Q i and then at time 2p \ starts 
again from 0 to use a random Chinese postman tour of Q 2 . The expected capture time 
T resulting from such a pair of strategies can be obtained as in the proof of Lemma 3.18, 
giving 

MlMl + R2(2Ml + M2) = Ml + M2 (Ml + M2). 

Conducting the search in the opposite order gives an expected capture time of 

M2 + Ml (Ml + M2)- 

Consequently, if the p ,■ are known, the searcher can ensure an expected capture time of 

min[/li +M 2 (Ml + M 2 ). M 2 + Ml (Ml + M 2 )]- 
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Since the two expressions in the bracket sum to 2(/x j + ji^), it follows that the hider 
can ensure an expected capture time of at least Ml + M2 only if these expressions are 
equal, or 

Ml M2 

Ml = 7 , M 2 = • 

Ml + M2 Ml + M2 

This analysis shows that if i» = /./, then an optimal hider strategy must hide in each 
subtree with a probability proportional to its total length. 

In general, an optimal hiding strategy will be constructed recursively by the 
following algorithm. 

Algorithm for hiding in a tree 

First recall our above argument that the hiding probabilities are positive only for the 
terminal nodes of Q. We start from the origin O with P(Q) = 1 and go toward the 
leaves. In any branching we split the probability of the current subtree proportionally 
to the measures of subtrees corresponding to the branches. When only one arc remains 
in the current subtree we assign the remaining probability, p(A), to the terminal node 
A at the end of this arc. We illustrate this method for the tree depicted in Figure 3.2. 

From O we branch into A i , C, 0\ , and O 2 with proportions 1,3,6, and 3, respec- 
tively. Thus, the probabilities of the corresponding subtrees are yy , yy , and yy , 
respectively. Since Ai and C are leaves we obtain p{A\ ) = yy and p(C) = yy . 
Continuing toward 0\ we split the probability of the corresponding subtree, yy, with 
proportions 5, 5, and | between B\ , i?2,and Ci so that 

M( fi i) = h' M( fi 2) = |i and p(C\) = 55. 

Similarly, 

p(A 2 ) = n x 5 = ^ and m0 *3) = n * 2 = h- 

In order to show that v = p. we shall demonstrate that the above described hiding 
strategy h is optimal for trees, i.e., guarantees an expected capture time of at least //. 
This proof begins with the following result. 

Lemma 3.20 Consider the two trees Q and Q' as depicted in Figure 3.3. The only 
difference between Q and Q' is that two adjacent terminal branches BA\ of length a 1 
and BA 2 of length a 2 (in Q) are replaced by a single terminal branch BA' of length 




Figure 3.2. 
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Figure 3.3. 



a\ + ai (in Q'). Lei v be the value of the search game in Q and let v' be the value of 
the search game in Qf . Then v > v' . 

Proof. Let h' be an optimal hiding strategy for the tree Q\ so that 

c(S',h')>v', for any pure trajectory S'. (3.3) 

We may assume, as explained above, that h' is concentrated on terminal nodes. Given h\ 
we construct a hiding strategy in h in the network Q as follows. For any node other than 
A] or ^2, the hiding probability is the same for h and h' . The probabilities p\ = h(A \ ) 
and p 2 = h(Af) of choosing A i and A 2 when using h are given by the formulae 

^ t / . a 2 f . . 

P 1 = : P and p 2 = p, (3.4) 

a 1 +02 +ai 

where p' = h'(A') is the probability of A' being chosen by h\ and a\ , 02 are the lengths 
defined in the statement of the lemma (see Figure 3.3). We shall show that v > v' by 
proving that for any search trajectory S in Q we have 

c(S, h) > v' . (3.5) 

In order to prove (3.5) we proceed as follows. Since the hider uses the strategy h that 
chooses its hiding point at terminal nodes only, it is best for the searcher to use a search 
trajectory, which has the following characteristics. Starting from the root O, it moves 
via the shortest route to some terminal node, then moves via the shortest route to another 
terminal node, and so on until all the terminal nodes have been visited. More precisely, 
we can say that any search trajectory is dominated by one that visits the terminal nodes in 
the same order (of first visits) and has the above “shortest route” property. Consequently, 
there is a one-to-one correspondence (denoted by ~) between the set of undominated 
search trajectories and the permutations of the terminal nodes. Bearing that in mind and 
assuming (without loss of generality) that the search strategy S visits the terminal node 
A 1 before visiting A 2 , S can be represented by the following permutation of terminal 
nodes: 



S ~ M/ P . . . , Ai,, Ai, Ajp . Ajj, A 2 , Ai { A/ l ). 
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Let dk — min(? : S(t) = A*} denote the time taken for the trajectory S to reach the 
node Ak, and let pk = h(Ak) denote the probability with which the mixed strategy h 
chooses Ak- With this notation, the required inequality (3.5) is equivalent to 

/ J L 

J2 di '» Pi* + d \P\ + Yl d jm Pj m + d 2P2 + d, m Pl m > V. ( 3 . 6 ) 

m = I m=l ra= 1 

In order to prove (3.6), we will consider two search trajectories in Q'\ 

5", ~ (A,-, A il ,A',A ji ,...,Aj J ,Ai l ,...,Ai L ) and 

&2 ~ (■'^h > • • • > ^i / > Aj t , . . . , Ajj , A , A/i , . . . , Ai l ). 

It follows from (3.3) that 

c(S[,h')>v' and (3.7) 

c(S’ 2 , h') > i/. (3.8) 

Note that the nodes of Q’ are A ' , A 3 , A 4 , .... For i = 1,2 and k > 3, let dik denote 
the time taken for the trajectory S' to reach the node Ak, and let df denote the time for 
Sj to reach A'. It is easy to see that the following relations hold: 



d\ im = d 2 i m = d im 

d \j,n = djm + la 2\ d 2 j m < d jm - 2a 1 

du m <d/ m ; d 2 i m < d lm 
d[=d\+a 2 \ d 2 <d 2 -a\. 



It follows from (3.7) and (3.8) that 
a\ 



a\ +a 2 



c(S[,h') + 



ai 

a t + a 2 



c(S 2 , h’) > 1 / 



Consequently, 



( 3 . 9 ) 



a l 

a\ +a 2 



+ 



Pirn + d\ P + d\j m Pj m + dll "‘ P,m J 
\ m=l m = l m= 1 / 

( J2 d 2i m Pim + J2 dljmPjm + ^ ^ Pl " ^ ~ 

\ m=\ m = 1 m — 1 / 



a i +a 2 



Using (3.9), we obtain 



E + £ djnPj- + I] + + 

w=l m=l m= 1 



tfl + «2 



/r'^2 > n', 



( 3 . 10 ) 
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and now the required inequality (3.6) immediately follows from (3.4) and (3.10), 
completing the proof. ■ 

Using Lemma 3.20, the next theorem uses induction on the number of terminal 
nodes in order to show that the hiding strategy h indeed guarantees an expected capture 
time > p for any tree. 

Theorem 3.21 Let Q be a tree with total length p. Then 

(i) The optimal search strategy is the random Chinese postman tour. 

(ii) An optimal hiding strategy can be constructed recursively using the Algorithm 
for Hiding in a Tree. 

(iii) V = p. 

IfQ is a network which is not a tree, then v < p. 

Proof. First we show that v-p. We know from Theorem 3.19 that V < p, so if 
the theorem is false there is some tree Q with a minimal number of nodes, for which 

V < p. Clearly, Q cannot consist of a single arc O A, since in that case the hider ensures 
a capture time of at least p, by hiding at A. In all other cases, we can apply Lemma 3.20 
to Q to obtain a tree Q‘ with fewer nodes and a search value v' satisfying v' < v and 
hence also v' < p. This contradicts the assumed minimality of the counter-example Q 
and thus proves that for any tree v= p. 

To prove that V = p, only for trees, note that by Theorem 3.19 v < p/2 < p. Thus, 

V = p implies p — 2 p, which implies, by Lemma 3.15, that Q is a tree. ■ 

3.4 When is the Random Chinese Postman Tour 
Optimal? 

In the case that the network Q is neither Eulerian nor a tree, it folio ws from Theorem 3.19 
that pi 2 < v < p. Yet it may happen, for some networks, that the random Chinese 
postman tour is an optimal search strategy (as in the cases of Eulerian networks and 
trees). In this section we analyze such networks. In Section 3.4.1 we present, as a starter, 
a family of networks for which the random Chinese postman tour is optimal, and in 
Section 3.4.2 we present the widest family of networks with this property. 

3.4.1 Searching weakly cyclic networks 

Definition 3.22 A network is called weakly cyclic if between any two points there are 
at most two disjoint paths. 

An equivalent requirement, presented in [205], is that the network has no subset 
topologically homeomorphic with a network consisting of three arcs joining two points. 

The difficulty in solving search games for the three-arcs network is illustrated in 
Section 3.5. Note that an Eulerian network may be weakly cyclic (e.g., if all the nodes 
have degree 2) but need not be weakly cyclic (e.g., 4 arcs connecting two points). 

It follows from the definition that if a weakly cyclic network has a closed curve T 
with arcs b\, ... ,bi incident to it, then removing T disconnects Q into k disjoint 
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networks Q\ Qk with bj belonging to Q,. (If Q, and Q j were connected then the 

incidence points of P with b< and with bj could be connected by three disjoint paths.) 
Thus, any weakly cyclic network can be constructed by starting with a tree (which is 
obviously weakly cyclic) and replacing some of its nodes by closed (simple) curves as, 
for example, T and P 1 in Figure 3.4 (all edges have length 1). 

We now formally define the above operation: 

Definition 3.23 Let V be a connected subnetwork of a network Q. If we replace the 
network Q by a network Q' in which T is replaced by a point B and all arcs in Q — F 
that are incident to 1' are incident to B in Q' , we shall say that T is shrunk and B is 
the shrinking node. 

It is easy to see that if Q contains a set of disjoint closed (simple) curves P| , . . . , T* 
such that shrinking them transforms Q into a tree, then Q is weakly cyclic. 

In order to obtain a feeling about optimal solutions for such networks, we consider 
a simple example in which Q is a union of an interval of length / and a circle of 
circumference p - l with only one point of intersection as depicted in Figure 3.5. 
Assume, for the moment, that the searcher’s starting point, O, is at the intersection. 

Note that the length of the Chinese postman tour is p = p + l. (Remember that 
the trajectory has to return to 0 .) We now show that the value of the game satisfies 
v — (p -f 1)/2 — p/2 and the optimal search strategy, 5 , is the random Chinese 
postman tour. 

The random Chinese postman tour guarantees capture time of at most p/2 by 
Lemma 3.18. The following hiding strategy, h, guarantees (at least) p/2: hide with 
probability p = 21 /(p + /) at the end of the interval (at A) and with probability 1 — p 
uniformly on the circle. It can be easily checked that if the searcher either goes to A, 
returns to O, and then goes around the circle or encircles and later goes to A, then the 
expected capture time is equal to (p + 1)12. Also, any other search trajectory yields a 
larger expected capture time. 
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Now assume that the starting point is different from 0. In this case the value and 
the optimal search strategy remain the same, but the optimal hiding strategy remains h 
only if the starting point is anywhere on the circle. As the starting point moves from 0 
to A, the probability of hiding at A decreases from 21 /(jj, + I) to 0. 

The solution to search games on weakly cyclic networks was presented by Reijnierse 
and Potters (1993). They showed that v = fl/2 and presented an algorithm for con- 
structing optimal hiding strategies. (The optimal search strategy is the random Chinese 
postman tour.) 

We now present a simpler version of Reijnierse and Potters’ algorithm. In our 
construction we transform the network Q into an “equivalent” tree Q as follows: Shrink 
each closed curve F/, with circumference y,-, and replace it by an arc c,- of length y,- /2 
that connects a new leaf (terminal node) C, to the shrinking node ZJ, . All other arcs and 
nodes remain the same. Let h be the optimal hiding strategy for the tree Q. Then the 
optimal hiding strategy for Q is obtained as follows: 

• For a leaf of Q (which is also a leaf of Q) hide with the probability assigned to it 
by h. 

• For a curve r, (represented by leaf C, in Q) hide uniformly along it with overall 
probability assigned by h to leaf C, . 

• No other arcs and nodes are ever chosen as hiding places. 

We now use the above construction for the network Q depicted in Figure 3.4, in the 
beginning of the subsection. The equivalent tree is depicted in Figure 3.2 (Section 3.3). 

Note that the curves T and T i are replaced by arcs 0 C and 0\C\. Thus, the optimal 
hiding probability is the same for the leaves of Q and (replacing the leaves C and C | 
by F and T i) hiding, uniformly, on F with probability -p^ (i.e., probability density 
and on F] with probability ^ (i.e., probability density ^). 

3.4.2 Searching weakly Eulerian networks 

Reijnierse and Potters (1993) conjectured that their algorithm for constructing the opti- 
mal hiding strategy for the weakly cyclic network as well as the result v = /i/2 hold for 
the wider family of weakly Eulerian networks, i.e., networks obtained from a tree by 
replacing some nodes with Eulerian networks. This conjecture was shown to be correct 
by Reijnierse (1995). They also conjectured that v = jx/2 implies that the network is 
weakly Eulerian. 

Gal (2000) provided a simple proof for the first conjecture and also showed that their 
second conjecture is correct. In order to present these results we first formally define 
the networks in question. 

Definition 3.24 A network is called weakly Eulerian if it contains a set of disjoint 
Eulerian networks F| F* such that shrinking them transforms Q into a tree. 

An equivalent definition is that removing all the (open) arcs that disconnect the 
network (the “tree part”) leaves a subnetwork(s) with all nodes having an even (possibly 
zero) degree. (Note that in particular removing an arc leading to a terminal node leaves 
the end node.) Obviously, any Eulerian network is also weakly-Eulerian. A weakly- 
Eulerian network has the structure illustrated in Figure 3.6. 
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Figure 3.6. 

Theorem 3.25 Let Q be a weakly Eulerian network and let A denote the length of the 
Chinese postman tour of Q. Then 

1. v = A/2. 

2. A random Chinese postman tour is an optimal search strategy. 

3. An optimal hiding strategy for Q is obtained as follows: First construct a tree 
Q by shrinking all the Eulerian subnetworks ofQ, and adding at each shrinking 
node a new leaf with half the length of the associated Eulerian subnetwork. Hide 
in every terminal node of Q according to the optimal probability for hiding in 
that node of the tree Q. Hide uniformly on each Eulerian subnetwork ofQ with 
a total probability equal to the optimal probability of hiding at the end of the 
associated “new leaf” ofQ. 

Proof. We already know from Theorem 3.19 that the value v of the search game on 
Q satisfies u < A/2. We will show that the stated hiding strategy ensures an expected 
capture time > /i/2, so that claims 1 and 3 will be established. Then claim 2 will then 
follow from Lemma 3.18. 

We first show that the length of a Chinese postman path in Q is equal to fi, the 
same as that for the original network Q. To see this, partition Q into the Eulerian parts 
E (union of the Eulerian subnetworks) and the remaining treelike part T. Since £ is a 
union of networks, we have 



H = H(Q)=n(E) + 2n(T). (3.11) 

If we denote the union of the “new leaves” of Q by E' and recall that 2 (jl(E') = 
we calculate the length jx{Q) of the Chinese postman tour of Q = E' U T as 

jHQ)=2gL(E') + 2n(T) 

= M(£)+2/i(7’) 

= A«2) = A. by (3.11). 

Since Q is a tree, this implies that its total length is A/2. Consequently by 
Theorem 3.21, there is an optimal hiding strategy h * on g, which guarantees an expected 
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capture time of at least /i/2. We will show how to adapt this to Q without changing 
this time. We now assume that lengths of all the arcs of the Eulerian subnetworks are 
rational. (We can do this because the value is a continuous function of the arc lengths.) 
Thus we can find an arbitrarily small e > 0 such that these lengths are even integer 
multiples of £. On such an arc of length 2 ks, add k additional nodes of degree two 
(called grid points), equally spaced at distances e,3e, ... , (2k — l)e from an end node. 
Do this for all the arcs in the Eulerian subnetworks. The optimal hiding strategy h for 
Q will be concentrated on the terminal nodes of Q and on these new grid points. 

Before defining h, we introduce a new tree Q as follows: Shrink all the Eulerian 
subnetworks, and at each shrinking node B add leaves of length e whose total length 
equals half the length of the corresponding Eulerian subnetwork. Identify the grid points 
of the Eulerian subnetwork with the terminal nodes of these leaves. The tree Q is the 
same as the tree Q except that instead of a single “new leaf’ of length say y/2 for each 
shrunk Eulerian network of length y, there are y / 2 b leaves of length £. Observe that the 
optimal hiding strategy for a tree will put the same total probability (spread out equally) 
at the ends of the y/2e small leaves of Q, as on the single end of the corresponding 
larger leaf in Q. 

An optimal hiding strategy h for Q can now be induced on Q from the optimal 
hiding strategy h on Q, with the probability of each grid point of Q equal to that of 
the corresponding end of a small leaf. Suppose now that some search strategy S on Q 
visits the grid and terminal nodes of Q in such a way that the expected time (relative 
to h) to reach such a node is less than /i/2. Let S be the search strategy on Q that 
visits its corresponding terminal nodes in the same order, moving between consecutive 
nodes in least time. Note that our construction of Q ensures that the distance between 
any pair of grid or terminal nodes in Q is at least as large as the distance between their 
corresponding terminal nodes in Q. This ensures that the search path S will not get 
to any terminal node of Q later than S gets to its corresponding node in Q. Hence 
the expected time (relative to h) for S to reach the hiding point is also less than fx/2, 
contradicting the optimality of h (and the value v = jx/2 for Q ). Consequently no 
search strategy S on Q finds an object hidden according to h is expected time less than 
jx/2, establishing our claim. ■ 

The strategy h given here is not uniquely optimal. For example, the simpler strategy 
stated in the theorem, of hiding uniformly in each Eulerian subnetwork, is also optimal. 
(Or that of hiding in the middle of each new arc.) 

We now illustrate the result by an example: Let Q be the union of an Eulerian 
network y, of measure y, and two arcs of lengths 1 and 2, respectively, leading to leafs 
Ci and C 2 (see Figure 3.7). 

If O 6 P then, Q would be a star with three rays of lengths 1, 2, and 0.5 y, 
respectively. Thus h hides at Ci with probability 1 /(0.5y -f- 3) at C 2 with probability 
2/(0.5y + 3) and uniformly on T with overall probability 0.5y/(0.5y + 3). If the 




Figure 3.7. 
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starting point is on the arc leading to Ci with distance 1 from C 2 , then Q would be 
the tree depicted in Figure 3.8. Thus, the corresponding optimal hiding probabilities 
forC|, C 2 , and V would be (0.5y + 2) / (0.5 y 4- 3) x 1/(0. 5y 4- 1), 1/(0. 5y + 3), and 
(0.5y 4- 2)/(0.5y + 3) x 0.5y/(0.5y + 1), respectively. 

We now prove the second conjecture of Reijnierse and Potters (1993), i.e., that all 
networks that are not weakly Eulerian have value strictly smaller than /i/2. 

Theorem 3.26 For any network Q. ifv = jx /2, then Q is weakly Eulerian. 

Proof. Let L be a Chinese postman tour of Q (with length jx). Let Qj, i = 1, 2, be 
the subnetworks of Q determined by the arcs traversed i times by L. Note that Q \ is 
the union of Eulerian networks: all its nodes have even degree. Let Q' be the network 
consisting of Q 1 and a version of Qi in which all its arcs are doubled. The resulting 
network Q' is Eulerian and has total length jx. 

We shall show that under the hypothesis of the theorem, the removal from Q of (the 
interior of) any arc b of Q 2 results in a disconnected network Q-b. Consequently, the 
removal of all such arcs leaves the network Q — Q 2 ~ Q\, which has all even degree 
nodes. Thus the original network Q will satisfy the equivalent definition given above 
for a weakly Eulerian network. 

Let b be any arc of the subnetwork Q 2 with length 1(b) > 0, and let h E be a mixed 
hider strategy for Q guaranteeing a capture time of at least /i/2 — e. We will establish 
that Q-b is not connected by showing that the alternative assumption of connectedness 
lead to a contradiction of the E-optimality of h f . 

Since Q-b is assumed to be connected, so is the network Q" = Q' — b — b', where 
b' is the added arc parallel to b. Since Q" is Eulerian and has total length jx — 2 1(b), its 
Eulerian tour produces a Chinese postman tour L' of Q-b with length jx — 21(b). The 
random Chinese postman tour s' based on L ' finds all points of Q-b in expected time 
not exceeding jx/2 — l(b). Denote the endpoints of b by A and C and its midpoint by B. 
Since b cannot be a leaf of Q (because its terminal node would become disconnected 
from the rest of Q), both of its endpoints A and C are visited by lJ . Consequently, we 
may extend to a Chinese postman tour of Q in two ways L A and L«. The tour La 
traverses b in both directions after its first arrival at A , the tour Lq does the same when 
it reaches C. The search strategies sa and sc based on random Chinese postman tours 
of La and Lc each guarantee an expected capture time no more than /i/2 for all hiding 
points H in Q. However, if H belongs to the half arc [A , B ], then it will be found by 
sa with a smaller expected time, namely 



c(s A , H) = c(s\ A) + d(A, H) < (jx/ 2 - 1(b)) + 1(b )/ 2 = jx/2 -1(b)/ 2. 
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It follows that the probability/? = h e ([A, Z?])of hiding in [A, 5] under the e-optimal 
strategy h E satisfies 



M/2 — £ < c(s A ,h e ) < 





+ 0 ~P) 




a - pi(b) 

~Y~ 



and consequently 



P < 



2s 

Wy 



Using sc instead of s A we can show that the same estimate holds for the probability 
of hiding in [B, C]. Consequently, the probability of hiding in b when using h E is 
less than 4 s/ 1(b). Now consider the search strategy s' A on Q, which first follows s', 
goes in the shortest route to A, and then traverses the arc b. For any H e Q — b, we 
have c(s' A , H) = c(s' , H) < /i/2 — 1(b), and for any H € b we have c(s' A , H) < 
p + d(0, /!) + d(A, C) < 2 /i. Hence for sufficiently small e, we have 






P 

< — 

2 



m 

2 ’ 



which contradicts the e-optimality of li c for e < 1(b)/ 2. ■ 

Combining Theorems 3.25 and 3.26 we have the following summarizing result: 
Summary 3.27 lfQ is weakly Eulerian, then v = /i/2; otherwise, v < /i/2. 

An equivalent statement is that random Chinese postman tour is an optimal search 
strategy if and only if the network Q is weakly Eulerian. 

Remark 3.28 Note that the value is /i/2, and the optimal search strategy is a random 
Chinese postman tour, independently of the specific starting point. If Q is not weakly 
Eulerian, then the value may depend on the starting point O. (For example, in the 
next section we show that the value does depend on O in the three-arcs network, see 
Remark 3.33.) 

We conjecture that the independence of the value on the starting point holds only 
for weakly Eulerian networks. 

Searching a network which is not weakly Eulerian is expected to lead to rather 
complicated optimal search strategies. We shall show in the next section that even the 
“simple” network with two nodes connected by three unit length arcs, requires a mixture 
of infinitely many trajectories for the optimal search strategy. 



3.5 Simple Networks Requiring Complicated Strategies 

In the previous sections we solved the search game for weakly Eulerian networks. 
(Remember that this family includes Eulerian networks and the trees as “extreme” 
cases.) We have shown that the random Chinese postman tour is optimal, and v = p/2, 
only for this family. We now present a simple network, which is not weakly Eulerian 
and hence has value strictly less than /i/2. 
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In this example, the network Q consists of A: distinct equal length arcs, b \, . . . , bk, 
connecting two points O and A. An immediate consequence of the scaling lemma 
presented in Chapter 2 is that it is sufficient to consider the case in which all the arcs 
have unit length. This example will also be considered in the next chapter, where we 
deal with a mobile hider. If the number k of arcs is even, then the network Q is Eulerian, 
and the solution of the game is simple. On the other hand, it turns out that the solution 
is surprisingly complicated in the case that k is an odd number greater than 1, even if k 
is equal only to 3. For this network /t — k + 1, so we know from the last section that 
v < (k + l)/2. We will prove the stricter inequality: 

Lemma 3.29 If Q is a set ofk non-intersecting arcs of unit length, which join O and 
A, and k is an odd number greater than 1, then 



k 1 

u < — + — . 
2 2 k 



(3.12) 



Proof. We will use a natural search strategy, s, based on the following definition: 



Definition 3.30 For a given set consisting ofk elements, choosing each element with 
probability 1 /k will be called an equiprobable choice. 

We now proceed in establishing (3.12) by presenting the following search strategy s. 
Starting from O make an equiprobable choice among the k arcs, and move along the 
chosen arc to A. Then make an equiprobable choice among the k - 1 remaining arcs, 
independently of the previous choice, and move along this arc back to O. Then move 
back to A and so on until all the arcs have been visited. Since k is odd, any such path 
ends at A. 

Let H be any pure strategy (i.e., a point in (Q) and assume that its distance from A 
is d, so that its distance from O is 1 - d. Let m = (k - l)/2, let i = 1, . . . , m, be 
the event that the hider is discovered during the time period (2(i - 1), 2/J, and let E / 
be the event that the hider is discovered during the time period (k - 1, k). Then 

m 

c(s, H) = £(2 i - 1 )Pr(Ei) -f (k - d)Pr(Ef) 

i=i 

m 2 1 

_ k J_ _ </ k 
~ 2 + 2k~ ~k < 2 + 2k' 



The case where Q consists of an odd number of arcs that connect two points has 
been used as an example for situations in which v < /i/2, but this case is interesting by 
itself. It is amazing that the solution of the game is simple for any even k (and also, as 
will be demonstrated in the next chapter, for any odd or even k if the hider is mobile), but 
it is quite complicated to solve this game even for the case that k is equal only to 3. The 
reasonable symmetric search strategy s, used in proving Lemma 3.29, which is optimal 
for an even k, can assure the searcher an expected search time less than k! 2 + 1/2 k. 
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We shall immediately show that ,5 is not an optimal strategy for an odd number of arcs. 
(Incidentally, the fact that the search strategy s is not optimal or even £-optimal, can be 
easily deduced from the following argument. If the searcher uses then the hider can 
guarantee a payoff that is close to A/ 2 + 1/2A only by hiding near A with probability 
1 — £. However, it can easily be verified that the payoff guaranteed by such a hiding 
strategy does not exceed 1 + 8, where 8 is small. Thus, the value of the game has to be 
smaller than A/2 + 1/2A, which implies that the strategy s cannot be optimal or even 
£-optimal.) 

In order to demonstrate the complexity of this problem, we now consider the case of 
A = 3. In this case, the symmetric search strategy s satisfies v(s) = 3/2 + 1/6 = 5/3, 
but we now present a strategy s, originally suggested by D. J. Newman, which satisfies 

u(j) = (4 + In 2)/3 < 5/3. 

The strategy s is a specific choice among the following family {,?/;• } of search 
strategies: 

Definition 3.31 The family (s/.} of search strategies is constructed as follows: Consider 
a set of trajectories Sjj a , where i and j are two distinct integers in the set { 1,2, 3} and 

0 < a < 1. The trajectory Sjj a starts from O, moves along bj to A, moves along bj to 
the point A a that has a distance of a from A (see Figure 3.9), moves back to A, moves 
to O along b m , where m £ {1,2,3} -{ij}, and then moves from O to A a along bj. 

Let F(,a) be a cumulative probability distribution function of a random variable 
a (0 < a < 1). Then the strategy Sf is a probabilistic choice of a trajectory Si jo/’ where 

1 is determined by an equiprobable choice in the set (1, 2, 3}, j is determined by an 
equiprobable choice in the set {1, 2, 3} - {i),anda is chosen independently, using the 
probability distribution F. 

Note that the symmetric strategy s is a member of the family [sp } with the random 
variable a being identically zero. 

We now show that there exists a search strategy s = s in [sp] with value less than 
v(s). The distribution function F that corresponds to the strategy s is the following: 



F(a) = 



0 

1 <?“ 

2 + T 

1 



a < 0, 

0 < a < In 2, 
a > In 2. 




(3.13) 



Figure 3.9. 
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(Note that F has a probability mass of 3/4 at a = 0.) Let Hp be any hiding point 
with distance p from A . Then it is easy to see that 



c(s, Hp) = -(1 — P) + 



i / r ln 2 

~A' +2 L ' 



adF(a) + p 



in 



(.2oc + 2~P)dF(a) + pO- F(fi)) 



)' 



It is easy to check that the derivative ofc(.f, Hp) with respect to ft is equal to zero 
for 0 < P < In 2 and to -1/3 for In 2 < p < 1 . Thus, it is sufficient to calculate 
c(s , H) for p = £, where e is small, and the calculation readily shows that 



v(.?) = 



4 + ln2 
3 



< 



5 

3 



= v(^), 



where s is the symmetric strategy. 

We now show that .v is the best search strategy in the family {.v /.- } by presenting a 
hiding strategy h that satisfies for all Sp e {sp}. 



c(s, h) > 



4 + In 2 
3 



(— 1.56). 



Definition 3.32 The hiding strategy h is presented as follows. Make an equiprobable 
choice of an arc bj, i e (1,2, 3}, and hide at the point ofb-, that has a distance P from 
A , where P is a random variable that has the probability density 2e ^ for 0 < P < In 2 
and zero otherwise. 

It is easy to see that for all Sija used by any search strategy s p : 
d ~ / 1 1 /*ln2 \ T 

— c(Sij a , /t) = 2 f - + - I 2 e'^dp I - j2e~ a = 0 forO < a < In 2, 



and 



Thus, 



2 

3 



for In 2 < a < 1 . 



c(sp, h) > c(Sijo,h ) 



1 

3 





4 + In 2 
3 



It follows that if the searcher can use only search trajectories S ij a , then s and 
h presented previously are optimal strategies and the value of the restricted game is 
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(4 + ln2)/3. It is very plausible that s and h are optimal even if the searcher is not 
restricted to S jj a trajectories. In order to show that, one would have to show that 

c(S, h) > ^ ~^ ln ^ for all S. 

(We have just proved this inequality for S — Sjjc). 

Numerically, verifying that the value is indeed (4 + ln2)/3 (within any desired 
accuracy level) can be done by dynamic programming, as will be described in the next 
section. However, formally proving that fact has required a lot of effort: Bostock (1984) 
solved a discrete version of the three-arcs game, allowing the searcher to choose his 
trajectory more extensively than Sua (but still limited to a subfamily of trajectories). 
By letting the number of grid points tend to infinity he showed that s and h are optimal 
under a weaker assumption on the search strategies. Finally, L. Pavlovic (1993a, 1993b, 
and 1995b) has succeeded in proving that the value is indeed (4 +ln2)/3 and that s is 
the optimal search trajectory, under no restrictions on the search trajectories. Pavlovic 
also presented the optimal solution of the search game for any odd number k of arcs. The 
optimal search strategy is randomly (equiprobably) choose the traversed arc, among the 
untraversed arcs, until only three untraversed arcs remain; then use s for these three arcs. 

Remark 333 Unlike the weakly Eulerian networks, the value of the three-arcs network 
depends on the starting point of the searcher. For example, assume that the searcher 
starts at the middle of arc b\. Then, by using the uniform hiding strategy, ll llt the hider 
can guarantee an expected capture time > 19/12 > (4 + ln2)/3. (It can be verified 
that the searcher’s best response is to go to one of the nodes, search and 63, and 
finally retrace b\ in order to search the unvisited half of b\.) 

Remark 334 Solving the search game in a network with an arbitrary starting point 
for the searcher, is an interesting problem, which has not been investigatedyet. Let v be 
the value of this game. It is easy to see that v < p/2, where p is the minimal length of a 
path (not necessarily closed ) that visits all the points of Q. Indeed, sometimes V ~ p/2 
as happensfor networks with p = p because the hider can guarantee expected capture 
time > p/2 by using the uniform strategy h^. (Such an example is the three-arcs game, 
which is much more tractable with an arbitrary starting point than under the usual 
assumption of a fixed starting point known to the hider.) However, it is not clear for 
which family of networks the equality v = p/2 holds. For example, does it hold for 
trees ? 

Remark 335 The problem of finding the optimal search strategy for a (general) network 
has been shown by von Stengel and Werchner (1997) to be NP-hard. However, they also 
showed that if the time of search is limited by a (fixed) bound, which is logarithmic in the 
number of nodes, then the optimal strategy can be found in polynomial time. The search 
game on a network can, in general, be formulated as an infinite-dimensional linear 
program. This formulation and an algorithm for obtaining its ( approximate ) solution 
is presented by Anderson and Armendia (1990). 



This observation was made by Steve Alpern. 
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The difficult problem of finding an optimal (minimax) search strategy for a (non 
weakly Eulerian) network could be somewhat simplified if the following property holds. 

Conjecture 3.36 Optimal strategies for searching for an immobile hider on any network 
never use trajectories that visit some arcs (or parts of arcs) more than twice. 

Note that this property does not hold for best responses to all hiding strategies. For 
example, in the three-arcs network, if the hiding probability is j — e for each point 
Bj , i = 1, 2, 3, having distance j from A , and e for C, , i = 1, 2, 3, having distance 
j from A , then for a small e the optimal search strategy is to go to A , then to one of 
the un visited Bj , then through A to the unvisited Bj , continue to C j and finally from 
Cj to Cj through Bj . Thus, the segment A B-, is traversed three times. Moreover, in the 
linear search problem, described in Section 8.1, there exist hiding distributions against 
which the best response search trajectories visit the unit interval an infinite number 
of times. Still, we conjecture that such a situation cannot occur against an optimal 
hiding strategy. This property holds for the weakly Eulerian networks and also for the 
three-arcs network. 



3.6 Using Dynamic Programming for Finding Optimal 
Response Search Trajectories 

Dynamic programming (DP) is a useful numerical optimization scheme developed 
by Bellman and others around 1960. It is based on the following recursive principle, 
which we apply to search problems. Assume that we wish to minimize the expected 
search time for an object that is hidden in one of n possible points with known hiding 
probabilities. Any search trajectory is determined by a series of actions a i, 02, . . . , a n 
such that each corresponds to going from the current location to a location that has not 
been searched before. Rather than trying to determine all the optimal actions together, 
DP determines one action in each step minimizing the sum of the expected immediate 
cost plus the expected future cost. The optimality equation has the following form: 

F(Z)= min \C(Z,a) + 

aeA(Z) L 



p(Z,a) 
P(Z) J 



x F(0(Z 



■ a)) 



(3.14) 



where 

• Z is the state of the search (i.e., the current “unsearched" part of the space and 
the location of the searcher), 

• F(Z) is the current expected remaining search cost under an optimal trajectory, 

• A (Z) = [a] is the current set of actions a available (i.e., what locations could be 
searched next), 

• P(Z) is the probability that, at state Z, the object has not yet been found, 

• C(Z, a) is the cost of action a at state Z (e.g. the time needed to go to a specific 
next location). 
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• p(Z , a) is the probability of that the object is in this location (piZ. , a ) /P( Z ) is 
the probability that the object is found at the present stage under the condition that 
it has not been found before), 

• 0 (Z, a) is the “new” state after using action a, (i.e., the updated unsearched part 
and searcher’s location), and 

• F( 0 (Z, a)) is the new (minimal) expected remaining search time, after using 
action a under the condition that the object has not been found at the present 
stage. 

Note that at each step the number of unsearched locations of the state space is 
reduced by 1 . Thus, the new search problem is reduced in some sense. 

If the number of possible hiding locations is n then we will have to use the recursion 
(3.14) forn steps and compute F(Z) for all the intermediate possible values of Z finally 
obtaining the value of the optimal solution, F(Zq), where Zo is the initial state, i.e., the 
unsearched part being the whole space and the searcher at the origin O. The optimal 
action for each state is obtained from the argmin of the right side of (3.14). 

Such a technique would obviously be inefficient if the number of possible states 
is an exponential function of n (e.g., search in a complete network with n nodes). 
However, there are many interesting examples where the structure of the problem keeps 
the number of possible states manageable (e.g., polynomial in n with a small degree). 
For example, assume that we search the real line, starting from the origin, looking for 
an object with n possible locations (on both sides of the origin). Then the number of 
possible states is bounded by n 2 because at each stage the current undiscovered part of 
the search space is determined by the two extreme right and left locations visited by the 
searcher. We will present the DP algorithm for the search on the line in Section 8.7. 

It is usually more convenient to use the recursive formula for the contribution f (Z) 
of the current undiscovered part of the search space to the remaining expected search 
tim e,/(Z) = P (Z) x F (Z). In other words, /is the sum of the original probabilities of 
the unsearched locations multiplied by the remaining time to reach them by the optimal 
search plan. Multiplying both sides of (3.14) by P(Z), the corresponding recursive 
equation for / is 

/(Z)= min [P{Z)xC(Z,a) + f(<KZ,a))}, (3.15) 

aeA(Z) 

where, as in (3.14), we compute /(Z) for all the intermediate possible values of Z. In 
the final step we obtain the value of the optimal solution, / (Zo), where Zo is the initial 
state, i.e., the unsearched part is the whole space and the searcher is at the origin O. 
The optimal action for each state is obtained from the argmin of the right side of (3.15). 
Here, there is no need to compute new conditional probabilities at each stage; the same 
location probabilities can be used all the time. 

We now illustrate the DP approach for the three-arcs problem of the previous 
section. We would like to verify that the minimal expected search time against the 
hiding strategy h (given in Section 3.5) v(h) is (4 + ln2)/3. The numerical value of 
v(h) can be approximated for any desired accuracy level using the following scheme. 
For each arc divide the segment starting from A with length In 2 into m intervals of 
equal length e — In 2 /m. For each interval, j, replace the (continuous) probability 
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density of h over this interval by one probability mass pj concentrated at the midpoint 
of the interval at distance (j — from A. (Choosing the center of gravity rather 
than the midpoint would lead to more accurate result, with the same complexity. For 
convenience, we present here the simpler version.) In the new (approximated) hiding 
strategy the object can be at any one of n = 3 m locations. For each stage, the current 
state space depends on the number of arcs with unsearched locations. 

In the first time period, before A has been visited, there are three such arcs so that 
the state of the search can be described by the 4-tuple (/, k 1 , k 2 , k 3 ), where for each 
j, 1 < j < 3; kj , 1 < kj < m is the number of unsearched locations (near A ) on 
arc /'; and i, 1 < / < 3 is the arc number where the searcher is now located (at distance 
£ x (I/ + from A ). The next location to be searched is either the next point on arc i 
or the left point of one of the other arcs. Thus, for i = 1, the recursive formula is 



f(\,k\,k 2 , k 3 ) = min 



P(k\, k 2 , k 3 ) x £ + /(l,l| - 1, to, *3). 

P(k \ , * 2 , k 3 ) x [2 - (*, + fe 2 )£] + /( 2, k x , k 2 - 1 , * 3 ), 
P(k\,k 2 , k 3 ) x [2 - (£| +A: 3 )e] + /( 3, k\, k 2 , *3 — 1) 



where P(k\,k 2 , k 3 ) is the sum of the probabilities over the unsearched locations. 

A similar recursion holds for i = 2 and i = 3 (see Figure 3.10). 

After one of the kj reduces to 0 (say k 3 ) then unsearched locations of the two 
remaining arcs would be in two segments 1 < l\ < k\ and 1 < l 2 < k 2 with the 
searcher either on arc 1 at distance (/| — |)£fromH (in this easel < lj) or on arc 2 at 
distance (l 2 — | )s from A (with 1 < l 2 ). The corresponding two arcs recursive formula 
used for i = 1 (similar recursion holds for i = 2) is 



<p(\,ki,l\-,k 2 , l 2 ) = min 



P(k\, l\ \ k 2 , l 2 ) x e + <p(l,ki,li + l;* 2 ,/ 2 ), 

P(k\J\\ k 2 , l 2 ) x (Zi + l 2 — 2 )e + <p( 2, k\,l\\ k 2 , l 2 + 1) 



where tp takes the place of / in (3.15) and P(k\, l\] k 2 ,l 2 ) is the sum of probabilities 
over the unsearched locations (on the two remaining arcs). 




Figure 3.10. 
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Note that the recursive formulas for the transition between three and two arcs has 
the following form (e.g., when k\ and k 2 > 1 and kj = 1): 



/(!> k\,ki, 1) = min 



P(ki , * 2 , 1) x [2 - (*, + k 2 )e] + /( 2, Jfci, *2 - 1, 1); 
P(ki,k 2 . l)x[2-(*i + l)e] 

P(ki, 1; * 2 . 1) x e + (p(l,ki,2;k 2 , 1), 

+ min 

P(£i, 1; * 2 , 1) x s + <p(2,k\, 1; k 2 , 2). 



When/; becomes &,• (i.e., arc i, say i = 2, has been completely searched) and only 
one arc remains, the following boundary condition is used: 



<p(i, k[,li',k 2 , k 2 ) = min 



*i *i 

(Ji 2 - 1) x e E Pi + JPJ x £: 

j=>\ j=h 

(2 - (k 2 - 1) x e) Pj - XI M x £> 

y'=h ;=/i 



Note that in each of the above formulae the number of unsearched locations corre- 
sponding to the right side states is smaller than the number of unsearched locations in 
the right side. This enables us to recursively calculate the <p values (0(iz 4 )) and then 
the / values (0(n 3 )). The minimal search time is obtained by 

f(Zo) = 1 — (m — j) x e + /(l, m — 1, m, m). 

(Due to symmetry we can assume that the search starts along arc 1.) The optimal 
search trajectory is obtained step by step by using the alternative that produced the 
minimal value at any given state in order to move into the next state. 

For the location probability based on h, given Definition 3.32 we should obtain an 
expected capture time approximately (4 +ln2)/3. This would demonstrate, numeri- 
cally, that the hider can indeed achieve (4 + ln2)/3. Since we have shown that the 
searcher can guarantee capture time not exceeding (4 + In 2)/3, it would follow that it 
is indeed the value of the three-arcs game and that s and h are optimal strategies. 4 

The DP algorithm will be used later in Sections 8.7 and 16.8 (search on the line 
and rendezvous search on the line). 



3.7 Search in a Multidimensional Region 

When considering search games with a mobile or an immobile hider in multidimensional 
compact regions, we would like to avoid unnecessary complications, and thus we shall 
make a rather weak assumption about the region Q. We shall use the following definition. 



4 Such a computation has been carried out by Victoria Ptashnikov, who also removed several bugs from 
the formulae. 
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Definition 337 A compact region Q is called simple if the boundary of Q can be 
represented as the union of two continuous single-valued functions in some coordinate 
system ( see Figure 3.11). 

We shall usually make the following assumption. 

Assumption 1. The compact region Q is the union of a finite number of simple 
regions with disjoint interiors. 

Note that we allow multiply connected regions with a finite number of “holes” 
in them. 

When considering search games in multidimensional regions, we assume that the 
hider is captured at the first instant in which the distance between him and the searcher 
is less than or equal to r, where the discovery radius r is a small positive constant. (The 
case in which r is not a constant is considered in the next section.) 

In this section, we are concerned with an immobile hider. Since Theorem 3.3 holds 
for any search space Q, it follows that by using the uniform hiding strategy, h^, the 
hider can guarantee that the expected discovery time is greater than or equal to p/2 p, 
(neglecting ()(r 2 ) terms), where p, is the Lebesgue measure of 0 and p is the maximal 
discovery rate. Thus, the value v of the game satisfies v > /i/2 p = /i/4 r. 

An upper bound for v can be derived by extending the notion of a tour (used for 
networks) to multidimensional region: 

Definition 3.38 A closed curve L that passes inside Q is called a tour if for any Z £ Q 
there exists Z' e L such that d{Z, Z') < r. 

It has been shown by Isaacs (1965, sec. 12.3) that if L is a tour with length r, then 
the searcher can guarantee an expected capture time not exceeding r/2 by choosing 
each one of the directions, of encircling L with probability 1/2. (We used such a search 
strategy for Eulerian networks.) Thus, if we could find a tour whose length T satisfies 

r<(l + e)p/p (3.16) 

then we would have V ~ p/2p = p/4 r, and in this case the strategies of the searcher 
and the hider already described would be £-optimal. We now show that for any two- 
dimensional region which satisfies a rather weak condition, it is possible to find a tour 
whose length satisfies inequality (3.16). 

Lemma 3.39 Let Q be a two-dimensional compact region that satisfies Assumption 1 
with the boundary of each one of the simple regions composing Q having a finite length. 
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Then for any e > 0 there is an r e such that for anyr < r e there exists a tour with length 
less than (1 + e)p,/2r. 

Proof. Let Q = Qi, where Q,, 1 < i < m, is a simple region with area 
fj,j . If for each 1 < i < m we could find a line that covers Q t with length less than 
(1 4- e/2)/Lt,/2r, then we could “link’ - these curves together and form a tour of Q by 
adding m arcs. The length of such a tour for Q would not exceed 



(>+§) 



+ mD 



£ 

! + - + 



2 rmD\ 
M / 



ff 

2 r' 



where D in the diameter of Q. Obviously, if r is small enough, then 2 rmD/p < e/2, 
so that we would have the required tour for Q. Thus, we may assume that Q is already 
a simple region. Let f\ and fa be the graphs of the continuous functions that bound Q. 
We can now cover Q by parallel strips of width 2 r each, as depicted in Figure 3.12. 

Let yj , yi, . . . be the lengths of the vertical line segments formed by these strips. 
Then by moving alternately along these line segments and along the boundary of Q, 
we form a covering line (not necessarily closed) for Q with length less than ^ yj +Y< 
where y is the length of the boundary of Q. Due to the fact that f\ and fa are Riemann 
integrable, it follows that for any e > 0, if r is small enough, then 2r y; < ( 1 +£/2)/r. 
Thus, the length of the covering line is less than 



(\ + - + — ) — 
V 2 fi ) 2 r 



< 0 +e)~ 

2 r 



for a small enough r. 



In proving Lemma 3.39 we used the fact that two-dimensional regions can be covered 
by narrow strips with little overlap. The analog construction for three dimensions would 
require covering the region with narrow cylinders, but in this case the overlap would 
not be negligible. Thus, by using the uniform strategy, the hider can keep the expected 
capture time above where fi > 1/2. It should be noted, though, that in the 

next chapter we will show that the value of a search game with a mobile hider in a 
multidimensional region satisfies 



v < (1 + e)fi/p, where e — > 0 as /• — »■ 0, 



and this bound is obviously applicable for the search game with an immobile hider. 

Remark 3.40 The result of this section extends to the case of J searchers. Thus, if pis 
the total rate of discovery of the searchers (see Corollary 3.6), then for two-dimensional 
regions v ~ p/2p. 
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3.7.1 Inhomogeneous search spaces 

In this section, we assume that the search space is a multidimensional compact region of 
the same type considered in the previous section. However, here we allow the maximal 
velocity of the searcher, which we denote by u(Z) , to depend on the location of the 
searcher. We assume that v{Z) is a continuous function which satisfies 

0 < i>| < i>(Z) < i>2, (3-17) 

where tp and V 2 are constants. We also allow the detection radius to depend on the 
location of the hider. In this case, the detection radius is a function r(Z) whereby if 
the hider is located at a point Z, then he can be seen from any point Z\ that satisfies 
d(Z \ , Z) < r(Z). We assume that 



r(Z) = rij(Z), 

where r is a small number and rj(Z) is a continuous function that satisfies 
min zeQ V(Z) > 0. 

The discovery rate of the searcher, p(Z), is defined as 2 v(Z)r(Z) for two- 
dimensional regions, 7 rv(Z)r 2 (Z) for three-dimensional regions, and soon. The capture 
time c(S, H) is given by 

c(S, H) = min{f : d(S(t), //(f)) < r(H(t )) }. 



Such a search space will be referred to as an inhomogeneous search space. We will 
show that results established in the first part of the section can be extended in a natural 
way. We first extend Theorem 3.3 to the inhomogeneous case. In this case, the natural 
randomization of the hider is given by choosing his location using a probability density 
which is proportional to 1 /p(Z). 



Lemma 3.41 

where 



Let h be the hiding strategy that uses the probability density \/xp{Z), 



f dp( Z) 

Q P(Z) 



(3.18) 



and p is the Lebesgue measure. Then h satisfies 



v(h)> 



1 r 

1 +e2' 



(3.19) 



Proof. A simple continuity argument (for details see Gal, 1980) implies that for any 
search trajectory S, the Lebesgue measure of the strip that is swept during a small time 
interval At is less than (1 + s/2)p(S(t))At. If the hider uses h, then the probability 
mass of such a strip is less than (1 + e)At/r, where r is given by (3.18). Thus, 
Pr(T < t) < (1 + e)t/x, so that 



c(S, h) 




Pr(T > t)dt > 




1 +£ \ 



dt = 



1 r 
1 +e 2' 
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We now extend the result established in the first part of the section and show that the 
value of the search game with an immobile hider in a two-dimensional compact region 
satisfies^ ~ r/2, wherer is given by (3.18) with 

p{Z) = 2v(Z)r(Z). 



The inequality v > (1/1 + e)r/2 is an immediate consequence of Lemma 3.41. 

An upper bound for the value can be obtained by constructing a tour (see 
Definition 3.38), such that the time required to encircle it is less than 



(1 +*) 



L 



dn(Z) 

2 v{Z)r{Z)' 



This can be done by dividing Q into several regions Q\, ... , Q„ such that the variation of 
r)(Z) in Qii = 1 n, is small and then using the technique adopted in Lemma 3.39. 

Remark 3.42 All the results presentedfor the networks in the previous sections can 
easily be extended to the case in which the maximal velocity of the searcher, v( Z), 
depends on his location in the network Q. {The radius of detection is assumed to be 
zero, as before.) In this case, the Lebesgue measure of Q, p, should be replaced by 



( d ^ Z) 
Q V(Z) - 




Chapter 4 

Search for a Mobile Hider 



4.1 Introduction 

The interest in search games with a mobile hider was motivated by the presentation of 
the Princess and Monster game described by Isaacs (1965, sec. 12.4). In this game, the 
Monster searches for the Princess in a totally darkroom Q (both of them being cognizant 
of its boundary). Capture occurs when the distance between the Monster and the Princess 
is less than or equal to r, where r is small in comparison with the dimension of Q. As 
a stepping stone for the general problem, Isaacs suggested a simpler problem in which 
Q is the boundary of a circle. The Princess and Monster game on the boundary of a circle 
was solved several years later by Alpern (1974), Foreman (1974), and Zelikin (1972), 
under the additional assumption that the maximal velocity in of the hider, satisfies u> > 1 
(recall that the maximal velocity of the searcher is usually taken as 1). Their formulation 
is a little different from the one that we usually adopt, in the sense that the searcher 
does not start from a fixed point O. Instead, they assume that at t = 0 the initial relative 
position of the searcher with respect to the hider has a known probability distribution. 
(Alpern, 1974; Zelikin, 1972, assume a uniform distribution, while Foreman, 1974, 
considers a general probability distribution.) A discretized version of this problem was 
solved by Wilson (1972). An attempt to develop a technique for obtaining approximate 
solutions of some discrete games with a mobile hider by restricting the memory of the 
players was presented by Worsham (1974). Some versions of this game with a fixed 
termination time were considered by Foreman (1977). A general solution of the Princess 
and Monster game in a convex multidimensional region, was presented by Gal (1979, 
1980). The solution will be presented in detail in Section 4.5. An important intermediate 
step is solving the search game with a mobile hider on a network consisting of k arcs 
connecting two points, which will be presented in Section 4.2. Some extensions were 
made by Garnaev (1991) (the velocity vectogram of the searcher is a rhombus-type 
set) and (1992) (the probability of detection depending on the distance between the 
players). A new type of search strategy presented by Lalley and Robbins (1988), will 
be described in Section 4.6. 1 . 
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Anderson and Armendia (1992) formulated the search game in a network as an 
infinite-dimensional linear program. They obtained an optimality condition for the 
strategies and an improvement technique (if the currently used strategies are not 
optimal). 

We now present the general framework of search games with a mobile hider, to 
be considered in this chapter. We assume that the search space Q is either a network 
or a compact multidimensional region. A pure strategy of the searcher is a continuous 
trajectory S(t), t > 0 inside Q which satisfies 

5(0) = O and forany 0 < l\ < : d(S(t{), S(?2)) < tj — 0- 

A pure strategy of the hider is a trajectory H(t),t > 0, inside Q, with H( 0) an 
arbitrary point chosen by the hider, which satisfies 

for any 0 < t\ < ti : d(H(t\), Wfo)) < u>(t i — t\), 

where w is the maximal velocity of the hider. 

The capture time T is the first instant t with d(S(t), H(t )) < r, or infinity if no 
such t exists. As usual, we take r = 0 for the search in a network and assume that r is 
a small positive number in the case that Q is a multidimensional region. 

When considering the role of mixed search strategies, one immediately observes 
that their advantage over pure search strategies is much greater in the case of a mobile 
hider than in the case of an immobile hider. In order to see this fact, consider the case of 
an immobile hider in an Eulerian network of length p. In this case, by using a trajectory 
(pure strategy) that traces an Eulerian curve, the searcher can guarantee an expected 
capture time not exceeding p, whereas the use of mixed strategies (tracing the curve 
equiprobably in each direction) enables the searcher to guarantee an expected capture 
time not greater than pH. Thus, the use of mixed strategies against an immobile hider 
yields an improvement by a factor of 2. 

On the other hand, if a mobile hider can move on a circle with maximal velocity 
W > 1, then no pure search strategy can guarantee capture. This is in fact true for any 
network other than the line segment. However, Alpern and Asic (1985) showed that if 
mixed strategies are allowed, then the search value is finite (see Section 4.4). 

In the case of the circle (see Section 4.3), the use of a mixed search strategy enables 
the searcher to achieve expected capture time not exceeding p. Thus, the advantage of 
using mixed strategies is much greater in the case of a mobile hider. 

Remark 4.1 The problem of finding the minimal speed advantage of the searcher, which 
guarantees capture using pure strategies only was investigated for a given network by 
Fomin (1999) and for rectangular domains by Avetisyan and Melikyan (1999a, 1999b). 
Note that having a search trajectory that guarantees capture actually guarantees it even 
if the ( mobile ) hider can see the searcher (but, as usual, the searcher cannot see the 
hider unless their distance is within the capture radius). 

Remark 4.2 If only pure search strategies are used then, except for trivial cases, several 
searchers are needed to guarantee capture. The interesting problem of determining the 
minimal number of such searchers in a given network, called the search number, was 
considered by Parsons (1978a, 1978b) and Megiddo and Hakimi (1978). This problem 
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has attracted much research. An important result for this game was obtained by Megiddo 
et al. (1988), who showed that the problem of computing the search number is NP-hard 
for general graphs but can be solved in linear time for trees. 

As in the discussion in Section 3.1, it is worthwhile to consider the unrestricted game 
(which is actually a discrete version of the search game) in which the physical constraint 
of the continuity of the trajectories of the players is disregarded. In this game, the set 
Q is divided into n “cells” Q ] , . . . , Q n , each with measure p/n. The searcher and the 
hider can move from one cell to another at the time instants f; = i At , i =0, 1 , n, 
where A t = p/np. The time interval At may be regarded as the time required by the 
searcher to sweep one cell. We assume that capture occurs at the end of the first time 
interval in which both the searcher and the hider occupy the same cell. It is easy to 
see that the value of the unrestricted game is p/ p, because both the searcher and the 
hider can guarantee this value by using completely random strategies, choosing each 
cell with equal probability, 1 In, independently of previous choices, at each time instant 
j =0, 1, . . . , n. 

It is worthwhile to estimate the probability of capture after time t in the unrestricted 
game, both in the case of a mobile and an immobile hider (assuming optimal play on 
both sides). If the hider is immobile then, under optimal strategies, the probability of 
capture after time t satisfies 

Pr(T > /) — 1 — pt/p, 0 <t< pip. 

On the other hand, if the hider is mobile, then 

Pr(T > t)~e- p,/tl , 0 < / < oc. 

In contrast to the unrestricted game, we shall be mainly concerned with games 
in which both the searcher and the hider are restricted to move along continuous tra- 
jectories, but it is interesting to note that in spite of that restriction, the value is still 
(1 4- e)p/p for most games to be considered in this chapter. The first search space we 
discuss is a network consisting of k arcs connecting two points (Section 4.2). Solving 
the search game for the k arcs can give us a useful insight, which helps us to understand 
how to solve the Princess and Monster game in multidimensional regions (Sections 4.5 
and 4.6). 

A remarkable property of most of the search games solved in this chapter is the 
following. There exists a function P(t), which decreases exponentially in t, such that 
for all t both the searcher and the hider can keep the probability of capture after time t 
around P(t). In practice, this property may sometimes be more useful than the expected 
capture time, since the cost function need not be T itself but may place a different (e.g., 
heavier) penalty on larger values of T. Indeed, this property can be used to show that the 
optimal (e-optimal) strategies obtained for the games in which the capture time serves as 
a cost function are still optimal (e-optimal) even if the capture time is replaced by a more 
general cost function (Section 4.6.8). In other words, the optimal (^-optimal) strategies 
of these games are uniformly optimal for all reasonable cost functions. (Strategies 
which guarantee P(t) are called uniformly optimal in Book II.) An exception to the 
existence of uniformly optimal strategies was investigated by Alpern and Asic (1986) 
(see Remark 4.16). 
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Remark 4.3 The Princess and Monster game was also used in some biological models. 
See Meyhofer el al (1997) and Djemai et al. (2000). 



4.2 Search on k Arcs 

Solving the k arcs search is important because it gives an insight into the nature of 
optimal search strategies for a mobile hider in more general domains and in particular 
for multidimensional regions. Here we consider the following search game. The search 
space Q is a set of k non-intersecting unit length airs b\, ...,bk, joining two points 0 
and A, as depicted in Figure 4.1. 




Figure 4.1. 



(The solution of this problem in the case that the arcs have a common length different 
from 1, is easily obtained by using the scaling lemma of Chapter 2.) The searcher has 
to start moving from O with maximal velocity equal to 1. The hider can choose an 
arbitrary starting point, and from this point he can move along any continuous trajectory 
in Q with maximal velocity w . In this chapter, we shall usually assume that w > 1 . 
The capture time 7j which is the loss of the searcher (or the gain of the hider), is 
the time elapsed until the searcher reaches a point that is occupied at the same time 
by the hider. In order to avoid unnecessary complications, we make the following 
assumption. 

Non-Loitering Assumption. The searcher can pass but not stay at the points O or A. 

This assumption is not needed for k - 2 (search on a circle) as we shall see in 
Section 4.3. 

We shall show that the optimal strategies of the searcher and of the hider are both 
of the type U (Z, r) to be defined. 

Definition 4.4 Let Z be either the point O or the point A and let 0 < r < OO be any 
nonnegative number. Then the (random) trajectory U(Z, r) is defined by the following 
rules. At time r, starting from the point Z, choose an integer i equiprobably from the 
integers ( 1 , 2 ,... ,k) and move along the arc bj with unit velocity to the endpoint of 
this arc ( O or A). Then (at time r + 1) make another equiprobable choice of an integer 
j € (1,2 , ... ,k), independently of i, and move along bj until the other end point 
(A or O) is reached, and so on. 

Using Definition 4.4, we state the following theorem. 
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Theorem 4.5 For the search game on k arcs, an optimal strategy for the searcher is 
U (O, 0). An S -optimal strategy for the hider is to start at A, stay there until time 1 — s, 
and then use the strategy U {A , 1 — e). The value of the game is k(= p./ p). 

The theorem will be proved using three lemmas. The following sweeping lemma is 
the fundamental one and will also be used in the next sections. 

Lemma 4.6 (Sweeping Lemma) Let Q denote the network consisting of two nodes 0 
and A, connected by k unit length arcs. Define k “sweepers, ” which move from O to A 
at unit velocity during the time interval 0 < t < 1, each along a different arc. Then 

(a) Any (continuous) hider trajectory H(t) will meet at least one sweeper by time 

t = 1. 

(b) If the hider’ s trajectory H(t) has velocity not exceeding one, then it will not 
meet more than one sweeper by time t = 1. 

Proof. 

(a) Let /(f) = d(H(t), O) denote the distance (in the network Q ) of the hider from 
the searcher starting node 0 at time t. Since H is continuous, so is the function H. 
The function t -f(t) is continuous on the interval [0, 1] and is negative for t = 0 
and non-negative for f = 1. So the Intermediate Value Theorem ensures that for 
some fo with fo < 1 we have to — f (to) = 0. At time to the hider is therefore at 
distance to from 0 along one of the k arcs - but so is one of the sweepers. 

(b) Let fo be the first time when the hider meets one of the sweepers - say, i. In 
order to meet another sweeper before time 1, he must move to a different arc. If 
he does this via the node O, he cannot catch the sweeper on that arc before time 
1, as he has the same speed. So he would have to move to another arc via A. 
However, he cannot reach A before sweeper i, hence not before time 1 . 



Lemma 4.7 If the searcher uses the strategy U(0, 0) (see Definition 4.4), then for any 
hiding strategy H 

c(U(0, 0), H) < k. 

Proof. Strategy U (O, 0) for the searcher means: At time f = 0, the searcher makes 
an equiprobable choice from the sweepers of the sweeping lemma and moves with the 
chosen sweeper to A ; then at time t = 1 he makes an independent equiprobable choice 
from k such sweepers that run from A to O, etc. We may look upon the hider as the 
fugitive, and thus the sweeping lemma implies that if the searcher uses U(0, 0) and 
if the hider has not been captured by time j - 1 , then the probability of capture in the 
time period j — 1 < t < j is at least 1 A (independently of the previous part of the 
trajectory). Thus, 

(k - 1 v 
Pr(T>j)<(—) ■ 



(4.1) 




50 



BOOK I. SEARCH GAMES 



It follows that for any hiding trajectory H we have 



c(t/(0, 0), H) < Pr(J - 1 < T < j) x j 
j= i 

7=1 7=1 ' ' 



k. 



Lemma 4.8 Let Ka be the strategy of the hider described as follows. Stay at point A 
until time t = 1 — S and then use U(A, 1 — e) (see Definition 4.4). Then for any search 
trajectory S, 

c(S, Ra) >k — e. 

Proof. Using strategy U(A, 1 — e) for the hider means that he chooses each of 
the sweepers (note that it is the hider who now moves with one of the sweepers) 
with probability l/k (and independently of previous choices) at the time instants 
1 — e, 2 — e, 

Now the searcher takes the role of the fugitive of the sweeping lemma. We shall use 
the fact that the velocity of the searcher does not exceed the velocity of the hider and 
the non-loitering assumption, which states that the searcher cannot wait for the hider at 
either of the points 0 or A. Thus it can be assumed that at any one of the time instants 
1 — e, 2 — e, .... the probability that the searcher is either at O or at A is zero (this can 
be achieved by the hider by using fas a random variable uniformly distributed in any 
small interval). Thus the condition of part ( b ) of the sweeping lemma is satisfied so that, 
for any trajectory of the searcher, if the hider has not been captured by time t = j — s, 
then the probability of capture in the time period j — e < t < j + 1 — e is equal to l/k. 
Thus, the probability that capture will occur in the time period j — e < t < j + l — e 
is equal to (A: — 1 /ky~ x x 1 /k. Hence 






1 

x t O' - e) = k - e. 
k 



Theorem 4.5 is an immediate consequence of Lemmas 4.7 and 4.8. Actually, if 
k > 1, then c{U(0, 0), H ) < k for any hiding trajectory II. The above statement 
holds because if H( 1) = A, then c(U(0, 0), H) = 1, while if H( 1) ^ A, then the 
capture time is equal to a, a < 1, with probability 1/ k. Thus 

c(U(O,0),H) <?- + l ^l(l+k) <k. 
k k 

Therefore, any mixed Hiding strategy h satisfies c(U(0. 0), h) < k. Since v = k, it 
follows that the hider does not have an optimal strategy (but obviously has i-optimal 
strategies) while the searcher does have an optimal strategy. This result is in accordance 
with the results presented in Appendix A. 
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Remark 4.9 It follows from the proof of Lemmas 4.7 and 4.8 that the searcher can 
keep the probability of capture after t below ( k — ([?J is the integer part oft), 

while the hider can keep the probability of capture after t above ( k — 1 / Jfe) Thus, 
if instead of choosing the capture time T as the cost function, we consider a more 
general costfunction W(T), where W is any monotonic nondecreasing function, then 
the searcher can keep the value below 



nw) = f; »-(i) x \ (i^V ' 

(=1 v 7 

while (for any e > 0) the hider can keep the value above 

“ 1 (k — 1 V 

f A W)^W(i-e)x- k \- 1 -j 



(4.2) 



IfW is continuous oil the left at the points T = 1,2,3,..., then f e (W) — > f(W) as 
£ — *■ 0. In other words, if the cost is a continuous nondecreasing function of the capture 
time, then by using \J(O,0), the searcher can guarantee the value f (W), while the 
hider can guarantee (1 — £)f(W). It follows that in this case, the value ofthe game is 
f ( W) and the optimal(E-optimal) strategies ofthe searcher and the hider are still the 
same as those described in Lemmas 4.7 and 4.8. 

It is interesting to note that the value of the game is equal to k irrespective of the 
maximal speed ofthe hider it), as long as it is not less than unity. If it) is less than unity, 
then the optimal strategies of the searcher and the hider may be quite complicated. 
However, using an argument similar to the one previously presented in this section, it 
seems to us that if k is large and it) is not too small, then the value of the game should be 
approximately k, even for the case 0 < it) < 1, because the hider can achieve a value of 
( 1 — e)k by randomly choosing one of the arcs and moving along it with speed U) from 
A to O, then using an independent equiprobabilistic choice of another arc and moving 
along the chosen arc with maximal speed from O to A, and so on. If m = f 1 /w] and 
the searcher moves with maximal speed (unity), then, when reaching each of the points 
O or A, the maximal amount of information, which may be available to the searcher 
is that at that moment the hider is not located on any of the m last arcs visited by the 
searcher. Thus, even if the searcher could rule out m arcs out of k each time he reaches 
O or A, then his gain would be to increase the probability of capture for each period 
j < t < j + 1 from \/k to 1 /( k - m). Thus the expected capture time would decrease 
at most to k-\\/tW], so that if k is large in comparison with 1 Iw, the value obtained 
in Theorem 4.5 would remain about the same even for w < 1 . 

Note that for an immobile hider (see Sections 3.2 and 3.5) the value v of the search 
game on k parallel arcs is equal to k!2 for an even k and is approximately equal to k/2 
for an odd k. Thus, for an immobile hider, d ~ p/2 p. For a mobile hider, v is doubled 
and becomes p,/p, and this is due to the fact that, contrary to the case of an immobile 
hider, the searcher cannot rule out the arcs previously visited by him. We shall have the 
same phenomenon for the case of a two-dimensional search space. 
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4.3 Search on a Circle 

In this section, we consider searching for a mobile hider on a circle. As in the previous 
section, we shall make the assumption that the maximal velocity w of the hider is greater 
than or equal to 1. By denoting the (known) starting point of the searcher by O and its 
antipode by A, the problem reduces to a special case of the search game solved in the 
previous section with k = 2 so that the value of the game is the length of the circle, p. 
However, we must be aware of the fact that, in the previous section, we restricted 
the strategies of the searcher by the non-loitering assumption which requires that the 
searcher cannot wait at 0 or at A. We now show that the result for k = 2 remains valid 
even without this assumption. The searcher’s strategy U(0, 0) (see Definition 4.4), 
which guarantees an expected capture time p(= 2), remains the same, but the hider’s 
strategy needs a little modification. At time t = 0, the hider should choose a random 
point. A', uniformly distributed around A with distance < £ (a small number). It is easy 
to see that by staying at A' until t = 1 — E and then using U (A', 1 — £), i.e., moving 
from A' to O' randomly, the hider can make sure that the capture time will exceed 
2 — s(= p — t ) because the probability of capture at O' or at A' is zero, so that the 
argument used in Lemma 4.8 is valid here as well. 

It should be noted that the result v = jj, depends on the assumption that the initial 
starting point of the searcher is known (5(0) = 0 ). This result is no longer valid if 
one changes this assumption. For example, if one assumes that 5(0) has a uniform 
distribution on the circle, as was done by Alpern (1974) and Zelikin (1972), then the 
value is ^p. This result is demonstrated by the following theorem, which is taken 
from Alpern (1974). (The optimal strategy below was given the name coin half tour by 
Foreman, 1974.) 

Theorem 4.10 Consider the search game with a mobile hider on the circle of circum- 
ference p, assuming that S(0) and H(0 ) are uniformly and independently distributed on 
the circle. Then 

v = (4.3) 

An optimal strategy for each player is to oscillate at speed 1 between his initial point 
and its antipode, each time making an equiprobable choice between the clockwise and 
counterclockwise directions, independently of previous choices. 

Proof. Assume for convenience that p = 2. For any t > 0, let / = [f J and 
a = t — \t\. Let 

< 44 > 

First we show that under the search strategy s, described by Theorem 4.10, the 
probability of capture after time t satisfies 

Pr(T >t)< L(t ) (4.5) 



for any hiding trajectory H. 




CHAPTER 4. SEARCH FOR A MOBILE HIDER 



53 



We prove (4.5) as follows. Assume without loss of generality that I is even. Denote 
the following events 



E: d(0, //(/)) < a (4.6) 

E: the complement of E. (4.7) 



Then the same considerations used in Lemma 4.7 lead to the inequalities 



Pr{T > t\E) < l/2 ,+1 (4.8) 

and 

Pr(T >t\E)< l/2 / . (4.9) 

Since 5(0) is uniformly distributed on the circle, it can be shown that regardless of the 
hider’s motion H 



Pr(E) = ot. 

Now (4.5) readily follows from (4.8) to (4.10). Thus, 
(•00 



(4.10) 



c(s , H) = f 
Jo 



Pr(T > t)dt 

OO 



r l 

= V / Pr(T > I + a) da < 3/2 (by(4.4) and (4.5)). 

7^o Jq 



The optimality of the hider’ s strategy h , described by Theorem 4.10, is established rather 
similarly, using Lemma 4.8 instead of Lemma 4.7, by proving that for any trajectory 5, 
if the hider uses h, then Pr(T > t) = L(t), and thus c(5, h) = 3/2. ■ 

Note that the optimal search strategy is the same as in the case that 5(0) is a fixed 
known point, while the hiding strategy has to be modified due to the fact that the hider 
does not know 5(0). 

The value would still be |/x in the case that both 5(0) and 7/(0) can be chosen by 
the players because each one of them can guarantee an expected capture time not worse 
than |/xby choosing a uniformly distributed starting point and then using the Coin Half 
Tour described above. 

In contrast to the foregoing discussion, we shall see in Section 4.5 that in the case 
of multidimensional search, the value of the game is not sensitive to any assumptions 
about the location of 5(0). 



4.4 Quickly Searched Networks 

In general, solving search games in a Network is difficult. Still, Alpern and Asic (1985) 
(part a) found the following upper bound, demonstrating that the search value is finite: 

Theorem 4.11 Let Q be a Network with m edges, n nodes, diameter D, and minimal 
tour length ft,. Then for any searcher starting point the value of the search game with 
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mobile hider satisfies 

(a) u < 6m D. (If no arc has length exceeding D, then u < Am D ) and 

(b) v < (n + 2 )(D + p/2). (This bound requires loitering strategies.) 

Proof. Consider the following mixed searcher strategy. Randomly pick one of the 
arcs e. Move so as to arrive equiprobably at one of the endpoints of e at time D. 
Observe that no arc can have length exceeding 2D, since otherwise the distance from 
its midpoint to an end would exceed D. So it will always be possible to traverse the arc 
so as to arrive at the opposite end at time 3D. (If e is a loop, then it should be traversed 
equiprobably in either direction.) If the hider has not been found by time 3D. then repeat 
the process in each time interval [3iD, 3(i + I )D], making choices independently of 
previously choices. Observe that in each time period, the probability that the searcher 
will encounter the hider is at least p = 1 /2m. This is because the hider will start the 
time period on some arc e and hence will meet the agent of the searcher starting at one 
of the ends of e. The expected capture time is therefore not more than 3 D/p = 6mD. 
In the common case that no arc has length exceeding the diameter D. the period 3D 
can be reduced to 2D and the value to 4 mD. 

If Q has many arcs but few nodes (for example, many multiple edges or loops), 
and loitering strategies are allowed, then another strategy may be better. From time 
0 to D move with probability n/(n + 2) to one of the nodes and wait there until 
time D 4- p/2 choosing among them equiprobably. With probability 2 J(n + 2) wait 
until time D and then traverse a minimal tour of Q halfway around, equiprobably in 
either direction. Repeat this process, with independent randomization, in each period 
of length D + /i/2. Whether or not the hider moves, he will meet the searcher in 
a given period with probability \/(n + 2), either when the searcher is waiting at a 
node or when he is searching. Consequently the expected capture time does not exceed 
(n + 2)(D + p/2). M 

For example, if Q is the graph with an even number m of unit length arcs connecting 
n = 2 nodes, then D - 1 and p — p — m. So the two estimates are 4m (part a) and 
4(1 +mJ 2) (part b). In this case estimate (b) is better for m > 2. For the complete graph 
on n nodes there are m = n(n - l)/2 unit length arcs and we have D = 1 and p > m. 
In this case the two estimates are 3 n(n - 1) (part a) and at least (n+2)(\+n(n - l)/4) 
(part b). For n > 9 the estimate in part (a) is better. 

What can we say about a lower bound for the value? In Chapter 3 we proved 
Theorem 3.19, which stated that the value of the search game with an immobile hider 
lies between p!2 and p, where p is the sum of the lengths of the arcs of the network Q. 
The lower bound is achieved for Eulerian networks, while the upper bound is achieved 
for trees. It would be interesting to ask similar questions concerning the search for a 
mobile hider. For example, 

Assuming that O = 5(0) is known to the hider, are there any networks with v < pi 
(Recall that v = p holds for the search on the circle or on k arcs under the non-loitering 
Assumption, ) 

It is not too easy to find such a network. (In fact, it was conjectured in Gal, 1980, 
that u > p always hold.) However, Alpern and Asic (1985) showed that v < p for the 
figure eight network presented as follows. 
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4.4.1 The figure eight network 

The “figure eight” network (see Figure 4.2) consists of two circles, each of unit 
circumference and only one point in common, O. Thus /./ = 2. 

Denote the antipodes of 0 by A for the left circle, and B for the right circle and 
denote the arcs from A to O by a i and ai, and from B to O by b\ and bj . Each of these 
arcs has length 1/2. 

We shall now show that if the searcher starts at the center 0 of the figure eight 
network, then 

15 15 

v < — — — a. 

~ 8 16 m 

Suppose the searcher begins by traversing the four possible arcs out of O equiprobably, 
until arriving at one of the endpoints A or B at time 1/2. By this time he will have 
captured the hider with probability at least 1/4. If capture has not occurred and the hider 
has met only one of the possible paths of the searcher (the worst case from the searcher’s 
point of view), then the game that remains at time 1 12 is the following: The searcher starts 
at one of the ends, say A , with probability 1/3, and at the other end with probability 2/3. 
The probability at each end is known to the hider. Call the value of this game (starting 
at time 0) v ' . For example, if the hider starts to the left of 0 and is not caught by time 
1/2, he knows that at this time the hider is at A with probability 1/3. If he started to the 
right of O then B will be the low probability starting point in the subgame. Assuming 
the searcher plays optimally in the subgame, this strategy ensures an expected capture 
time (and hence an estimate of V) given by 



(4ii> 



The estimation of v' is carried out in the following. 

Lemma 4.12 v' < j|. 

Proof. Assume that A is the low (1/3) probability end. Suppose the searcher starts 
by going at speed 1 from his starting point (A or B ) equiprobably along either the top 
pair of arcs (a\b[) or the bottom pair of arcs {(iib^), arriving at time 1 at the other end 
( B or A ). Regardless of subsequent play, the optimal response of the hider is to meet a 



a, to. 




Figure 4.2. 
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possible searcher path from the low probability (A ) side of O at time just before 5 . If he 
has not been captured by time ^ , then it is even more likely than at the beginning that the 
searcher started at B, so he should go close to A, meeting a searcher path starting from B 
just before time 1. If he has not been captured by time 1, the remaining game is the same 
as the original one, with the starting probabilities on the ends reversed, so with optimal 
play the expected remaining capture time is again v' . Consequently, the hider will be 
captured just before time ^ with probability (j)(|) = g and just before time 1 with 
probability (j)(j) = g . With the remaining probability ^ the same game will be played 
again (with A and B reversed) starting at time 1. Consequently, we have the estimate 

^(1)1 + (!)■ + (1)0 + A- 

Solving the above inequality for v' gives 



Substituting this value oft/ into the inequality (4.11) gives v' < Writing this in 
terms of the length p of the network, we obtain the desired result. 

Theorem 4.13 The value of the search game with mobile hider on the figure eight 
network with the searcher starting at the center satisfies 



This can be achieved by a strategy that is consistent with the non-loitering assumption. 

Remark 4.14 Note that if the starting point of the searcher is at a known antipode, 
say A, then the value of the search game is 2 = p. (The hider can use the e-optimal 
strategy for the graph that consists of the two arcsa\ U h\ and a 2 U 62 with a random 
small £. Since the non-loitering assumption prevents the searcherfrom ambushing at 
O, this strategy is as effective as in the two arcs game as described in Section 4.2.) 
The advantage of starting at 0 is that at time j the hider does not know whether the 
searcher sweeps from A to B or from B to A. 

Can the expected capture time be reduced below j|/z? Yes, if we do not limit the 
searcher by the non-loitering assumption and allow the searcher to ambush at the node 
O, with some (small) probability for some time interval. The fact that the searcher can 
actually gain from ambushing can be deduced as follows. It is easy to see that, against s, 
the hider’s best response involves crossing 0 once just before t = 2 and an additional 
time before t = 3. Now, if the probability that the searcher ambushes at 0 for the 
time interval [2,3] (and later continues to use s) is small enough then the hider’s best 
response is still to cross O during 2 < 1 < 3. Thus, if the hider is still at large at t = 2, 
then the (conditional) capture time if the searcher ambushes is smaller than if he uses s . 
So the expected capture time for a mixture of the above strategies is smaller than jg/z. 

We can effectively remove the limitation produced by the non-loitering assumption 
(note that an ambush could be useful only at 0 ) by considering the spectacles network 
given by Figure 4.3. 
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Instead of the central node, O, of the figure eight graph, we have here a very 
short central interval. Thus, for the spectacles network, even with the non-loitering 
assumption, the searcher can “ambush” by slowly moving in the central interval, say 
during 2 < / < 3. 

Alpern and Asic (1985) conjectured that the spectacles graph (with an infinites- 
imal central interval) achieves the minimum possible v/fj, under the non- loitering 
assumption. 

They also conjectured that when loitering is allowed, the quickest graph to find the 
hiderin is then n-leafed clover (n —*■ oo) consisting of a central vertex, where the searcher 
starts, to which n loops of equal length are attached. Both of the above conjectures are 
still open problems. 

Remark 4.15 Analyzing the effect of ambush strategies in a star consisting of a central 
vertex O to which n line segments of equal length are attached , seems a “neat” problem. 
However, even for this simple graph, finding the optimal search strategy, allowing 
ambushing at the central node, seems quite complicated. (A reasonable search strategy 
is, for each stage, either to visit a segment among those that were not inspected lately, 
or, with a small probability, ambush at O). 

Remark 4.16 Alpern and Asic (1986) investigated the effect of removing the non- 
loitering assumption in the Ic-arcs game (see Section 4.2). They showed that allowing 
the searcher to ambush at vertices (O or A) in the Ic-arcs game reduces the value below 
kfor k > 3. Ifk = 3, then the value remains 3, but the optimal strategies are not 
“uniformly optimal" (see Remark 4.9). The analysis is quite complicated (probably 
because of the lack of uniformly optimal strategies). 

It should also be noted that if we allow the searcher an arbitrary starting point, then 
the lower bound for the value decreases to p/2 (= lower bound for an immobile hider). 
This lower bound, v = f.i/2, is obtained in the case that Q is any line segment [a, b] 
and the searcher can choose his starting point. (The searcher can guarantee an expected 
capture time not exceeding (b - a)/2 by choosing S(0) = a with probability | and 
S(0) = b with probability 1/2, and moving with maximal speed to the other end of the 
segment, while the hider can guarantee an expected capture time greater, or equal to 
( b-a ) / 2 by choosing a point H that is uniformly distributed in [a, b], and staying at H.) 
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4.5 The Princess and the Monster in Two Dimensions 

4.5.1 General framework 

In this chapter, the domain of search is a two (or >2) dimensional convex 1 region in 
a Euclidean space. The searcher can move along any continuous trajectory that starts 
from the origin O. The hider can choose his initial location and an arbitrary continuous 
trajectory starting from that point and can move along his trajectory with maximal 
speed w. In contrast to the search on k arcs, we will not require that w > 1 but assume 
that w is not too small. (The exact formulation of this condition will be presented in 
Section 4.5.3.) 

The notations that will frequently be used in this chapter are /./, the Lebesgue measure 
of relevant search space; D, its diameter; and p, the maximal rate of discovery of the 
searcher (which is equal to 2 r for two-dimensional sets, n r 2 for three-dimensional 
regions, etc.). The radius of detection r will be assumed to be small in relation to the 
magnitude of Q. We shall proceed under the assumption that the detection radius is a 
constant, and the maximal velocity of the searcher is 1, but in Section 4.6 we show how 
to extend the results to the case in which both the radius of detection and the maximal 
velocity of the searcher depend on the location inside Q. 

For the case of an immobile hider in a two-dimensional region, it has been shown 
in Section 3.7 that the value, v, of the search game, satisfies v ~ p/2p. For the case of 
a mobile hider, we show in Section 4.5.2 that the searcher can guarantee an expected 
capture time not exceeding (1 + s)p,/p. The dual result is presented in Section 4.5.3. 
We show there that the hider can make sure that the expected capture time will exceed 
(1 — s)p,/p. Thus, we demonstrate that the value of the Princess and Monster game in 
a multidimensional region satisfies v ~ p/p, independently of the shape of the search 
space, and we present £-optirnal strategies for both players. Specifically, we prove the 
following theorem. 

Theorem 4.17 For a convex two-dimensional region, Q, the value, v, of the search 
game with a mobile hider satisfies 



v = (1+e)^- 

2 r 



where e — * 0 as r — > 0. 

This result can be extended to non-convex regions as well using a mild modifica- 
tion of the optimal strategies, as discussed in Section 4.6.2. We conduct the detailed 
proofs for two-dimensional regions, but in Section 4.6.3 we also show how to extend the 
results to higher-dimensional regions and also discuss some other extensions, includ- 
ing the result that our strategies remain £ -optimal even if a more general cost function 
is used. 



1 In Section 4.6.2 we show that our results also hold for nonconvex regions. We make this assumption in 

order to simplify the presentation. 
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Figure 4.4. 



4.5.2 Strategy of the searcher 

In this section, we shall prove that for any bounded two-dimensional convex 2 region 
Q, if the detection radius r is small, then the searcher has a strategy 5 that makes sure 
that the expected capture time does not exceed ( 1 4- e)fi/2 r, where e is small. 

As an easy “starter” consider searching (for a mobile hider) in a narrow rectangle 
with width 1 and height £ = 2 rk ( k being any positive integer and r very small). Cover 
Q by k parallel strips, each of height 2 r. Consider k sweepers (each of them assigned 
to a different strip) that move with speed 1 in the middle of the strip from one (narrow) 
side to the other in the time segment r < t < z + 1 (see Figure 4.4). 

Then, just as in the k arcs network, at least one ofthese sweepers must meet the hider. 
The problem now is very similar to the k arcs network, except for a small perturbation. 
Assume that the searcher starts from one of the narrow edges. He can choose any 
sweeper (with equal probability) to reach him by time ro = 2 kr andjoin him during the 
time segment r < / < r + 1 . Then, at time rj = ro + 1 + 2 kr, he can (randomly ) join 
another sweeper, etc. Just as in Lemma 4.7, the expected capture time would satisfy 

f < 2kr + £(1 + Irk) = e + k( 1 + £) ~ k = — . 

2r 

We now construct the search strategy for any convex region Q. The search strategy 
s to be considered has the following general structure. The region Q is covered by a set 
of parallel congruent rectangles with height much smaller than the width; a rectangle, 
chosen randomly, is entered and examined by moving N times from one narrow side to 
the other along randomly chosen heights; then another rectangle is randomly chosen, 
etc. The number N should be large enough to “absorb” the effect of the time spent in 
going from one rectangle to another, but on the other hand, N must not be so large 
that too much time is spent in one rectangle. Having this idea in mind we proceed as 
follows. 

Let D Q be the set with minimal area that is a union of the rectangles 
Bi,l = 1, . . . , L, . . where each /?/, with height yj, and width 2a, is parallel to the 
x axis as shown in Figure 4.5. 



2 We have already noted that convexity is not really needed (see Section 4.6.2). 
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Let p be the area of Q and p a be the area of Q a Then 

Ma = (l+0a)M. (4.12) 

where, by the convexity of Q, 

(p a 4 0 as a l 0. (4.13) 

For any Z\, Z 2 € Q„, let d(Z\, Zf) be the minimal length of a path that connects 
Z\ and 2 2 and passes inside Q a . As usual, D will denote the diameter of the relevant 
search space (Q u ). 

We shall give a constructive proof of the following theorem. 



Theorem 4.18 Let r satisfy 


a 2 

r = e— - 


(4-14) 




2D 


and assume that 




Oo 

II 

A 


(4.15) 



Then there exists a search strategy s in Q a such that for any evading trajectory H used 
by the hider, the expected capture time c(s, H) satisfies 

c(s, H) < (1 + 45) = (1 + 48)(1 + < 4 - 16 ) 

We use the following construction. Let 

a = aS 2 . (4.17) 



We first divide each rectangle B/,l = 1, .... L, of size yi x 2 a (see Figure 4.5) by 
horizontal lines, into narrow rectangles, Q m , so that all of these rectangles except 
possibly one, have a width a, while the upper one has a width a' < or, then split 
each rectangle Q m into two halves, each of length a, by a vertical line in the middle 
and denoted by Q m 0 - the left rectangle and Q m 1 - the right rectangle, as depicted in 
Figure 4.6. 
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: 


On, • 



















Figure 4.6. 



Thus, each rectangle Q m j , i = 0, 1 ; m = 1 M, has a height < a. Since fj, a = 
2 ayi (see Figure 4.5), then the number 2 M of such rectangles satisfies 



2M = 2 V"[yv/a'| <2V-+2L<L^+2 
/=l i—\ a aa 



D + a 



aa 



/, + 2 «(D+^\ t / 1 + S\ 

\ v-a ) <*a\ 2) 



( 4 . 18 ) 



by (4.17) and (4.15). 

Let A be a positive integer defined by 

N = 



D 

Sa 



Let y be a random variable y with the density 

fiy) 



2 


for 0 < y < r 


a + 2r 


1 


for r < y < a - r 


a + 2r 


2 


for a — r < y < a 


a + 2r 



= 0 elsewhere. 



( 4 . 19 ) 



( 4 . 20 ) 



The search strategy 5 is composed of independent repetitions of the following step. At 
time? = 0 make arandom choice out ofthe narrow rectangles <2io> Qu, ■ • ■ Qmo< Qm 1 
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such that each rectangle has a probability 1 I2M of being chosen, and also make 

a random choice of N independent random variables y \ y^, where N is given by 

(4.19) and all the y„, n = l, N have the probability density given by (4.20), and 
where a is the height of the chosen rectangle. (In the case of the upper rectangle, with 
heighta' < a, then a' replaces a in (4.20); ifar' < 2r we define y to be identically a' /2.) 
In order to describe the motion of the searcher within Q„,i, we shall use a coordinate 
system with origin at the lower left corner of Q m j as depicted in Figure 4.7. At time 
t - 0, the searcher starts moving as fast as possible to the point (ia, yi ) (i.e., on the 
left vertical edge for i = 0 and on the right edge for i = 1). He rests at that point until 
t - D and then moves with maximal velocity horizontally to the other vertical edge 
(to (a, yj) if i = 0 and to (0, _vi ) if / = 1) and reaches it at time t = D + a, then he 
moves vertically to y = yi, i.e., to the point (a, yf) if i = 0 and to (0, >> 2 ) if i = 1. He 
rests there until t = D + a + a, and at that moment he starts moving horizontally to 
the other vertical edge, etc. 

The important feature of the movement of the searcher, using s , is that at the time 
segments 

r„ =[D + ( n — l)(a + or), D + (« - l)(a + a) + a], n = 1 N, 

he moves, horizontally, along the intervals that join (0, y n ) to (a, y n ). We shall show 
that for this kind of movement, the following proposition holds. 

Proposition 4.19 If the searcher uses s, then, for any hider strategy, the probability p 
of capture during the time segment 0 < t < D + N (a + a) satisfies 




Proof. Consider a specific time segment r„ given by 

r„ = {/: D + (n - l)(a + a) < t < D + (n — l)(a + a) + a) (4.22) 



where 1 < n < N. 
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We shall distinguish between two cases. If n is odd and i = 0 or if n is even and 

i - 1, then for any t € t„, we define G„„- (/), m = 1 M, as the vertical line 

segment of width a, which has a distance d(t) from the left vertical edge of where 

d(t) = t — (D + (n — l)(a + a)) (4.23) 



so that G„„(/) is given by 



G m ,(0 = [(d(t), y),0 < y < a'). (4.24) 

If n is even and i - 0 or if n is odd and i = 1, then for any t e z„, we define 
G m i(t), m = 1, . . . , M, as the vertical line segment of width a', which has a distance 
d(t) from the right vertical edge of Q m j, i.c., in this case 



G,„;(r) = {(a - d(t), y),0 < y < a'}. (4.25) 



In both cases we define 



M 1 

U U G '»w- 

» i = 1 /=0 



(4.26) 



By an argument similar to those used in proving the sweeping lemma (4.6) of Section 4.2, 
one can show that the following lemma holds. ■ 



Lemma 4.20 If H is any trajectory used by the hider, then for any n there exists at 
least one time instant t n € V„ (see (4.22)) such that the point H (t„) visited by the hider 
at time t„ satisfies H(t„) e G(t„) (see (4.26)). 

It follows from the lemma that for any n there exist m and i such that 



H(t„) e G m , •(/„). 



(4.27) 



Let l,„i(n) = 1 if (4.27) holds and zero otherwise, and define 

N 

Inn = J2 '""W- ( 4 - 2 8) 

n=l 

Then it follows from the foregoing discussion that 

M 1 

J2 L / '»' - N ( 4 - 2 9) 

m= 1 i=0 

Assume that the searcher chooses the rectangle Q m \ . If Imi (n) = 1 , then it follows from 
the definition of the random variable y n that the probability of capture during the time 
segment z n is greater than or equal to the probability that at the time t„ (see (4.27)) the 
random interval Y„ given by 



y, I = [yn - r, y n + r] n [0, a'] 




64 



BOOK!. SEARCH GAMES 



will contain the v-th coordinate of H(t n ). Now, it follows from (4.20) that for any point 
b in the interval fO, a'\ the probability that b e Y n is greater than or equal to 

2 r 2 r 

> . 

a’ + 2r a + 2r 

Since the random variables >i , . . . , yjv are independent, it follows that the probability 
of capture during the time segment 0 < t < D + N(a + or) is greater than or equal to 

/ 2 r V mi 

(s “ (4 - 28,) - 

Since each rectangle Q,„i, m — 1, . . . , M, / = 0, I, is chosen with probability 
1/2 M, it follows that the probability p of capture during the time segment 0 < t < 
D + N(a + a) satisfies 



= 1- — f ) . 

2M “ \a +2r J 

m= I i=0 x 



Since for any nonnegative integers J, K 

/ iv 

1 - 



a + 2r 







it follows that 



(^T*) + {^h?) ” 1 (a^2r) 



so that (by (4.29)) 

M 1 



rn=l 1=0 

\a + 2rj 



Hm= I 5Zi=0 bn 



< 2M - 1 + 

Thus, it follows from (4.30) and (4.31) that 



” - Tm (‘ " (^) )' 



( 4 . 30 ) 



( 4 . 31 ) 
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We can now proceed with the proof of Theorem 4.18. First we note that (4.14), 
(4.15), (4.17), and (4.19) imply that 

2 r _ 2«V 2 a > 8 

a ~ 2 D8 2 a D ~ N' 

Thus by (4.21), the probability p of capture in the time segment 0 < t < D + N(a +Qf) 
satisfies 



>_L ( x l UJLA __! 1 J) 

~ 2M\ (1 + (2 r/a)) N )~ 2 M\ (1+ (8/N)) N ) 



(1 4 (2 r/a)) N 

> _L(\ L.\ = 

~ 2M V 14-5/ 



2M(1 4 5) 



(4.32) 

(4.33) 



Now, since the search strategy s is composed of independent repetitions of the step 
described for the time segment 0 < / < D 4 N(a 4 a), then for any hiding trajectory 
//, the probability pk of capture after the time instant / = K(D 4 N (a 4 a)) satisfies 

PK <(!-/>)*• (4.34) 



Thus the expected capture time c(s, H) satisfies (see (2.5)) 

c(s, H)<(D 4 N (a 4 a)) V* PK < (D + N(a ± f f)) (by (4.34)) 

K= 0 * 

i±i) 0 „( 1 + 4 + 2) (by(4 . 32)) 

8 \ aN a ) 

< ^ (j 4 |) + l)fl(l+« + 8 2 ) = (by (4. 17)— (4. 19)) 

=^(' + 0 <i+<) ( i+ ^) <i+i+<2) 

< -(1443) = gi(l4 43) (4.35) 

(a*/D)£ 2 r 

(by (4.14) and (4.15)). This completes the proof of Theorem 4.18. 

The following corollary is important for establishing the £ -optimality of.v for a more 
general cost function (see Section 4.6.8). 

Corollary 4.21 If the searcher uses the strategy s described in the proof of Theorem 4.18, 
then for any hiding trajectory H the probability that the capture time T exceeds t satisfies 

Pr(T > t) < (l + e)exp 



I, = 



t 

D 4 N(a 4 a) 



with e — » 0 as r — * 0. 
Proof. Let 
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Then it follows from (4.32) and (4.34) that if the searcher uses then for any H 
Pr(T >t)< Pr{J > (D + N(a + a)) /,) 

SI, 



<(l-p) , '<(l ) < exp ( — ) 

- V 2A/(1 + S) J - 2Af(l + «)/ 

( tS S ) 

V 2M(l+S)(I) + N(a + a)) + 2M(l + S) ) 



< exp 



by (4.12), (4.14), (4.15), and (4.17)-(4.19): 



2 rt S 

p(l+<p a )(\ +48) + 2M(\+8) 

Since both 8 — > 0 and <j> a 0 as /• — ^ 0, we obtained the desired result. ■ 

It should be noted that if the searcher uses the strategy .v presented in the proof, then 
apart of his trajectory, which is near the boundary of Q a might be slightly outside of the 
original search space Q. However, if Q is convex, then we can make a slight modification 
in s and introduce a search strategy s that uses trajectories entirely inside Q and still 
guarantees that the result (4.16) of Theorem 4.18, holds. The strategy s is defined as 
follows. If the chosen rectangle Q m j is inside Q, then the movement is identical to the 
one in s. However, if a part of Q m j is outside Q, as depicted in Figure 4.8, we make the 
following modification. 

Assume that in the strategy s the searcher moves from the point (0, y n ) to ( a , y n ) in 
the time segment r n (see (4.22)) and then moves from the point (a, y n + 1 ) to (0, y n +l) 
in the time segment r„+i. The movement in s is as follows. The searcher moves from 
the point (x n , y n ) to (x^, y„ ) (see Figure 4.8) in the time segment 

D + (n- 1 )(a + a)+;t„ < t < D + (n - 1 )(a + a) + x'„, 





Figure 4.8. 





